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ON THE FREQUENCY FUNCTION OF xy 

By Cecil C. Craig 

Given the distribution function of z and y, what can be said of the distribution 
of the product xy? The author has had two inquiries during the last two years, 
one from an investigator in business statistics and the other from a psychologist, 
concerning the probable error of the product of two quantities, each of known 
probable error. There seems to be very little in the literature of mathematical 
statistics on this question. 

If X and y are independent and are each distributed according to the same 
normal frequency law, it is well known that the distribution function of 

X — rrix y — f/iy 
z — 

is 

-iCo(a),> 

IT 

in which Koiz) is the Bessel function of the second kind of a purely imaginary 
argument of zero order.^ If x and y are independent and are each distributed 
according to a logarithmic normal frequency law, it has been pointed out that 
the product, (x — a) {y — h), in which a and b are the upper (or lower) limits 
of the range for x and y respectively, is distributed according to a law of the 
same type.^ In both cases the special choice of origins greatly simplifies the 
problem. 

In the present discussion it will be assumed that x and y are distributed 
normally. It will appear that the distribution of xy is a function of the 
coefficient of correlation between x and y, and of the parameters. 


_ Wl _ 
<ri 


mi 


and 


mz my 



which arc proportional to the reciprocals of the coefficients of variation. The 
chief difficulty arises when pi and ps are small so that zero values of xy occur 


‘ J. Wishart and M. S Bartlett. The Distribution of Second Order Moment Statistics 
in a Normal System; Proceedings of the Cambridge Philosophical Society, Vol. XXVIII 
(1932), pp. 455-159. 

® G N. Watson: A Treatise on the Theory of Bessel Functions; Cambridge University 
Press (1922), p 78. 

‘ P. T. Yuan : On the Logarithmic Frequency Distribution and the Seini-lpgarithmic 
Frequency Surface; Annals of Mathematical Statistics, Vol. 4 (1933), pp 46, 47. 
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for values of x and y well within their respective ranges of variation. (If pi 
and pt are large, practically one may exclude zero values of x and y from con- 
sideration. The author hopes to present an investigation of this case soon.) 
It is the object of the present paper to study the rather unusual frequency 
function that arises in this situation. It will first be assumed that x and y 
are independent (r,y = r = 0). Then it will be shown that the distribution 
fimction when r 0 is readily derived from that arrived at in the special case. 

We can find the moment generating function of xy without difficulty. We 
have, 

f f e e^dxdy 

uK J—ao «o 

(1 - cl <r\ ■ 

Setting, for convenience, 


this can be written, 

( 1 ) 


z 


xy 

(TlffJ ’ 


MM) 




This choice of variable and of parameters will be adhered to in the sequel. 

On expanding log MM) in powera of i?, we get for the semi-invariants (of 
Thiele), 


( 2 ) 


= (2fc 1) 1 pipt , fc = 0, 1, 2, • • • 

X«.. = ^^(j>\ + p\) + {2k-l)l, k= 1,2,.... 


These give for the mean and variance of xy, 

Mty = mim* 

<r*y = <r\ml -b (r\m\ erjo-* . 

For the standard semi-invariants of z (or of xy), we have, 

t. _ _ (2fc -b 1) ! pipt 

^2 • (p? -b P* -b 1) * 

_ (2ifc — 1) 1 [fc(pi -b Pa) “b ij 
' Xjr. (P?+P? + 1)* 
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Taking, 

t _ 6 pip; 

+ + 

as a measure of skewness, it is easy to verify that 

UalgfVS. 

For either pi = 0 or ps = 0, the distribution is symmetrical about its mean 
which then falls at the origin. 

For the excess or kurtosis, we have, 

t = 6 [2(pi -(- pi) + I] ^ „ 
ipi + pi + iy ^ ■ 

Thus the skewness is never great and bccomef! small with increasing pi or pj. 
The excess also becomes small with increa.sing pi or ps, but it can be very large 
for small values of these parameters, attaining its maximum of 6 for pi = p 2 = 0. 
But, as it will appear below, the distribution function always becomes infinite 
in a logarithmic manner at the origin. (We have already seen, as must ob- 
viously be the case, that moments of all orders exist.) It is to be noted, too, 
that for any given pi and pj, hk increases without limit with increasing k, and 
that the same is true of { jt+i if neither pi nor p? is zero. 

Turning now to the derivation of the actual frequency function of z, we set 

w = xy; then for any given x, y — w/x, dy = — if i > 0, and dy = — - 

X Xf 

if a: < 0. These values are substituted into <pi{y)dx dy, in which ipi{x) and 
<Pi{y) are the frequency functions of x and y respectively, and the resulting 
expression is integrated over all values of x, giving for the frequency function 
of w: 




in which. 




Again setting z = and introducing the parameters p\ and pj, this reduces to, 




( 4 ) 


Hz) 


2t 


~ hi^)] , 
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in which, 

(5) 






and ^ 2 ( 2 ) is the integral of the same function over the interval ( — «> , 0). 
Now writing, 

(6) ^ 1 ( 2 ) = / e * **’ ^ dx , 

jo a: 

we note that 






can be expanded in a Laurent series in powers of % for all values of x except zero. 

. r 

In this expansion the coefficient of x’“h r g 1, is ^ '2,(piP2 2 ), in which 

r! 


2,(«»*)-i + ?r4 + 


(piPs «)* 


+ 


(pip2 2 )® 


, ■ r + 1 ^ (r + 2)«> 2 1 ^ (r + 3)^»> 3 

((r + = (r + fc) (r + fc -• 1) • • • (r + 1)) . 

We may note parenthetically that 




^ Sr(piP5z) = f;^V VpiP* z) , 


\P2 Z/ 


in which I,(x) is the Bessel function of the first kind with a purely imaginary 
argument.* 

Z' P2 


The coefficient of x"^*, r ^ 0, is YjApm z)- 

T ! 

Setting now, 




pii+pi- 

X 

e 


we substitute this series in (6) and seek to justify the expansion it gives for 
^i(z) obtained by term by term integration. We write, 


^i(z) 


= 'Em dx+J\~^~ Zfnix) dx . 


* Watson, loo. oit., p. 77. 
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For 2 > 0, pipt > 0, the, terms of ^ fn{x) are all > 0. Then the conver- 
gence of 





fn{x) dx 


is sufficient to allow term by term integration in the first integral. In the second 
integral we observe that X) converges uniformly in every fixed interval 
1 ^ X ^ a. Then term by term, integration is permissible here if 



fnix) dx 


is convergent.® It is evident, then, that it will be sufficient to establish the 
convergence of 



** fn{x) dx . 


If either or both z < 0 or pipj < 0, it will be easily seen that the series involved 
are still absolutely convergent which is sufficient. 

Now using the definition of the Bessel function of a purely imaginary argu- 
ment of the second kind, 


KXz) 


2 V2 



e 


4t dr 


^v+l > 


it is easy to derive the relation, 


iz) = 

2 



?! _iL 
2 ~ ik' dx 

x" ' 


Remembering that K,{z) = K-,{z), we have for our expansion, 


il'i(z) = ifg -)- (pi -f- pj) z” 5^1 K\ -b {p\ -f- p|) ^ 


4- (pi 4" P*) Sa + • • • 

in which the argument for all the ^ z, and for all the Jif-fimc- 

tions is z. 


* T J. I'a Bromwich: An Introduction to the Theory of Infinite Series; Macmillan <& Co., 
London, 2nd edition (1926), pp. 496 and 500. 

* Watson, loc. cit., pp. 78 and 183. 
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But we may as well add to this the expansion of —'piiz), which may be 
written, 


i: 


*1 ^ 

! ~ 3 1 * e 


dx , 


and obtain the expansion, 


Pi+Pj 


F{z) = 


2o-Ko + (pi + Pi) ^ ^iKi + (p} + pi) iC2 + ' • ■ J > 

the convergence of which we will examine. But it must be noted that the 
terms arising from the expansion of 


Pl* + /U- 


and 


which contribute to the expansion of F(z) as just written are those of the forms, 

and 

Hence the expansion as written is valid in any case only for z > 0. For z ^ 0, 
we may write however, 


F(z) = — - — (pi, + Ps) (pi + pj) — 

+ (pi + pl) £» ii^»+ • • • J , 

in which the arguments for the 22 a^d K-functions are the same as before. 

Let us consider now the question of the convergence of (8), first in the case 
that z > 0. We set 

^ /z-iXp-i 

•'!/(»'— 1)1 ‘ 

Then from the relation, 

(9) K,-i - K,+i = - - X, ,» 

Z 

we readily derive, 

z* ^ / '2v \ 


’ Watson, loc. cit., p. 79. 
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For 3 > 0, the left hand member and c, are both > 0. Thus 


Then let 






> 0 . 


Cp+1 _ ^ + 5i.+i , 5,+i > 0 , 

and we have, 

{v + 1)'“^ ^ \ r/ ^ • 

It is evident from this that for a given a > 0, a i^o exists such that c, < 3 
for V ^ vt^. 

Further since 


the convergence sought follows for 3 > 0. Since K is an even function of z, 
it is easy to see that (8) is also convergent for 3 < 0. For z = 0, the first term 
possesses a logarithmic discontinuity at the origin. 

To calculate ordinates of F{,z) there are fairly extensive tables available in 
Watson's treatise already referred to. These tables may be readily extended 
by means of the asymptotic formula for K{z) for larger values of z, and by means 
of (9) for larger values of v. One can rapidly build up tables of ^r(®) by means 
of the easily obtained recursion formula, 


S)>-U) - 2r+l(x) + Sr+2(a:) . 

It is unfortunately true that the expansion found for Fi,z) is very slowly 
convergent for large values of pi and p2. 

At the end of this paper are shown three charts of F{z) with the tables of 
ordinates from which they were made by way of illustrating what such curves 
look like. (On the second for comparison the broken line is the normal curve 
of error.) 

For Pi = p2 = r = 0, we have simply the known result, 

F{z) = i Xo iz) . 

TC 

For Pi = 1, p2 = r = 0, the curve is symmetrical about its mean (and the 
origin). Here every 23-function is unity. 

For the case in which pi = P2 == |, r = 0, 1 first constructed tables of 23'(*) 
for z = ±0.025, ±0.05, ±0.1, and by intervals of 0.1 to ±3.0 for f = 0, 1, ■ . . , 
20. Values of 23o(^) and ^2(3;) for x = 3.2 and 3.4 were also used. Kot 
more than five terras of (8) were required to obtain values of Fiz) accurate 
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to five places of decimals. This distribution curve is skew with Mt = 0.25 
and fs , = -y . 

The curves are plotted in standard units with unit total area {a. — 
Vpi + /ij + 1). The tables of ordinate.s are given both in units of 

3=.^andofJ = ^-^'. 

(Tiffa a. 

Turning now to the case in which r 0, after some computation, we have 
for the moment generating function, 


{p,+P|— J pipj)<*’+2 
'j(l-(l + r)t>lIl+(l-r)iJ] 


um = 


%/[! - (1 + r),>] [1 + (1 - r)d] ■ 

As a check on this result, if we set r == 1 and pi = p 2 = p in it we get. 


(d) = 


Vl - 2d 


which may be readily verified to be the moment generating function of 


if X is distributed normally with mean m and variance 




To obtain the semi-invariants of z in this case, on expanding log Af,(d) iP 
powers of d, setting 

« = Pi + pj — 2pipjr , h = 2pip2 , c = 1 -f r , and d = 1 — r , 
we have, 

log MM) = ° \ (1 - cd)-> (1 + dd)-‘ 

- ^ [log (1 - cd) -t- log (1 -I- ddl] 

( 11 ) 

== f - [2 + (c^ - d«) d -H (c’ -f- d’) d* -h (c< - d‘)d® + ■ • • ] 

+ 2 ~ + (c** + ^ -f - d’) ■ J I 

from which we derive, 

K, = j [{c"“‘ - (- d)"-‘| o -j- (c" - (- d)"! 5] 


+ !c" -h (~ d)"} . 
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In particular, 

. h . c - d 

Ai.i = 2 2 — “ ^ 

^2 — /»? J_ W2 

Xj » — a H 2 — • ^ + — 2 — = Pi + P2 + 2/)ip2r + (1 + »'■) 

(13) Xa , = I [(c^ - d?) a + (c3 + (23) b] + _ d^ 

= 6 [(p? + p^) r + P 1 P 2 (1 + r*)] + 2 r (3 + r') 

Xi.* = 6 [(c3 + (23) a (c4 _ d'*) b] + 3 (c* + d^) 

= 12 (pi -j- P 2 ) (1 -f- 3 r*) 4- 24 pipaf (3 -f- r^) 6 (1 -j- 6r -)- r^) . 


Noting that 



9b _ „ _ 1 

¥ ~ ’ Jr~ ' 'dr'" 


one can easily demonstrate what seems to be a rather striking property of these 
semi-invariants, viz., 


(14) 


= n(n - 1) X 

dr 


n— 1:« ♦ 


To gain a notion of the magnitude of the skewness and excess in this case, 
we form, 


» 


and { 4 ., 

1 ^ 2 :» 



In view of the above property, 

9^3 

dr 


6 Xl - 3 XaXi 


The denominator of this fraction is always > 0. The numerator, after some 
reduction, can be written, 

6 [p{ -f- Ps — PiPj (1 — -h (pi -f- Pi) (2 — r*) 

+ (Pi + P2 “ 2) piPiT -f- 1 — r’“] . 

The first two terms taken together, the third, and the last are all obviously > 0. 
The term, 


(Pi "h P* — 2) piptT 
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has its maximum value for 1 r | = 1. But for r = 1, (15) becomes, 

Pi + p2 + PlPiiPl + Ps) + (pi — PiY t 

and for r = —1, it is, 

Pi + P 2 — PiP 2 (pi + Pa) + (pi + pa)** j 

both of which expressions are easily seen to be > 0. 

Thus (15) is always positive and the maximum value of fs,, is attained for 
r = 1, the minimum value for r = —1. These values are respectively, 

6 (pi + p2)^ + 8 - 6 (pi - pa)^ - 8 

[(pi + pa)’ + 2]^ [(pi — pa)^ + 2]5 

the absolute value of either being ^ 2\/2, which is attained in the first case 
for pi = —p 3 and in the second for pi = pa. It is seen that for high correlation 
between z and y the skewness of zy can be quite large. 

For the excess, we see that 


^4 » = 



attains a value of 12 when pi = — pa, r = 1 or when pi = Pa, r = — 1. Since 
this is such an extraordinary value it does not seem worth while to carry out 
the extended computation that seems to be required to verify one’s surmise 
that this is the maximum of the absolute value. 

Noiir, to derive the frequency function we proceed as before. We set 2 = 

triira 

and then 


in which, 


Ji(«) 


F(z) = Mz) ~ hip) , 


2ir y/l 


L=r 

'l - Jo 




dx 


and Ij(z) is the integral of the same function over the interval (— <», 0). 
We can write Ii(z) : 


2(l-rn 


+ 


25r Vl 


- 7 : 


211 


^ ?) + rw. p,)»+(p,“r P,) i] 


da; 

z 


Setting, 


vr 


= « and 


= r, 
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this becomes, 


(16) 


vr 


p-2rp,p,+ p, ^ rf 


e 


2(l-r=) 




l-K 


X 


P,-rpit 


But on -writing, 


Pi — ''Pa 

Vl -7^ 


and = R , , 

-v/T^= 


the integral in the last expression is of the same form as the 1 ^ 1 ( 2 ) in the un- 
correlated case. It is evident, then, that the distribution function of f can 
be -written, 


Vl - 


pJ-2 , P,Pj+p_, 


>• f 


2(l-r') gd-rJ)' 


Eo iRMUt) 


(17) + {Rl + Rl) ^ E, {RJi,t)m) + mt + Rl) E. {RMK2id 


+ (Rl + Rl) Eo (RiRiOR.ii^) "h ' ' ) 

and IS essentially of the form of F{z), reached when r = 0, multiplied by an 
exponential function. 

Frequency curves for xy (m standard units) are given in Fig. 1, Fig. 2 and 
Fig. 3. 



Fig 1 
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Tables op Ordinates op the Distribution Functions, F{z) and F{t) 


For pi = P 2 == 0, r = 

0 

Pi = 

1, P2 = 0, 

r = 0 

(Curve is 

symmetrical with 

respect 

(Curve is symmetrical with respect 


to origin) 



to origin) 



I— 1 

II 

b 

o' 

II 


M. 

= 0, (Tj = 

V2 

z = l 

F{2) ^ F(l) 

z 

Fiz) 

t 

Fit) 

0 1 

0.77256 

0 1 

0.58215 

0.07 

0.82328 

0.2 

.55790 

0 2 

44891 

.14 

63485 

0.3 

43887 

0.3 

37159 

•21 

52551 

0,4 

35477 

0 4 

.31736 • 

28 

44882 

0.5 

29425 

0 5 

.27593 

.35 

.,39023 

.0 6 

0.24749 

0 6 

0.24270 

0.42 

0 34323 

0.7 

.21025 

0 7 

21519 

,49 

.30432 

0 8 

17996 

0 8 

. 19193 

57 

,27143 

0,9 

.15493 

0 9 

17195 

64 

.24318 

1,0 

.13402 

1 0 

15460 

71 

21863 

1.2 

0 10138 

1.2 

0 12595 

0 85 

0.17812 

1.4 

07756 

1 4 

.10340 

0.99 

14623 

1.6 

05983 

1 6 

.08533 

1 13 

12068 

1.8 

.04645 

1.8 

.07069 

1.27 

09997 

2,0 

.03625 

2.0 

05873 

1 41 

,08306 

2.4 

0 02235 

2.4 

0.04078 

1.70 

0 05767 

2 8 

.01395 

2 8 

.02846 

1.98 

.04025 

3.2 

.00878 

3.2 

.01992 

2 26 

.02818 

3.6 

00557 

3.6 

.01397 

2,55 

01976 

4,0 

00355 

4.0 

00981 

2 83 

.01388 

4 8 

0.00146 

4 8 

0.00485 

3.39 

0.00685 

5.6 

.00061 

5 6 

.00239 

3.96 

00338 

6.4 

.00026 

6 4 

.00118 

4 53 

.00167 

7.2 

.00011 

7 2 

.00058 

5.09 

.00082 

8 0 

.00005 

8 0 

.00029 

5 66 

00040 

9.0 

0 00002 

9.0 

0 00012 

6 36 

0 00017 

10.0 

.00001 

10.0 

.00005 

7 07 

.00007 



11 0 

.00002 

7.78 

,00003 



12,0 

.00001 

8.49 

.00001 
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Pi = P2 = i ) r = 0 


M , = 0.25, 


z 


£ 

I'\i) 

-9.0 

0 00001 

-8 04 

0 00001 

-8 8 

.00002 

-7.39 

.00002 

-8 0 

0 00004 

-6 74 

0 00005 

-7 2 

00010 

-6 08 

00012 

-6 4 

00023 

-5 43 

00028 

-5 6 

00054 

-4 78 

. 00060 

-4 8 

00128 

-4 12 

00157 

-4.0 

0.00311 

-3 47 

0 00381 

-3 6 

00488 

-3,14 

00598 

-3 2 

.00709 

' -2.82 

.00942 

-2.8 

01221 

-2 49 

.01495 

-2 4 

01954 

-2 16 

02393 

-2 0 

0,03165 

-1 84 

0.03876 

-1 0 

.05213 

-1.51 

06384 

-1 2 

08809 

-1.18 

.10788 

-0,8 

15508 

-0 86 

.19066 

-0.4 

30423 

-0.53 

37259 

-0 2 

0.47388 

-0,37 

0 58036 

-0 1 

64994 

-0 28 

.79598 

0 1 

0.68106 

-0.12 

0.83409 

0 2 

61947 

-0 04 

.63619 

0 4 

0 36322 

0 12 

0.44484 

0.8 

.21768 

.45 

.26659 

1.2 

. 14230 

.78 

17427 

1 6 

.09621 

1 10 

.11783 

2.0 

.06614 

1.43 

.08100 

2 4 

0.04589 

1.76 

0.05620 

2.8 

.03201 

2.08 

.03920 

3.2 

02241 

2.41 

.02745 

3.6 

.01571 

2.74 

.01924 

4.0 

.01103 

3 06 

.01351 
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pi — P4 — 2) I" — 0 

M , = 0,25, < T . - ^ 


z 

m 

4.8 

0 00545 

5.6 

.00269 

6,4 

.00133 

7.2 

.00065 

8,0 

.00032 

8.8 

0.00016 

9,6 

.00008 

10.4 

.00004 

11,2 

.00002 

12.0 

,00001 


l 

F { 1 ) 

3 72 

0 00667 

4.36 

.00329 

5 02 

.00163 

5 67 

00080 

6,33 

.00039 

6 98 

0.00020 

7.63 

,00010 

8 29 

,,00005 

8 94 

00002 

9,59 

,00001 
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-6 -5 -4 -3 -2 -I 0 I 2 3 4 5 6 

Fig. 3 


University of Michigan. 







A NEW EXPOSITION AND CHART FOR THE PEARSON SYSTEM 
OF FREQUENCY CURVES 

By Cecil C. Craig 

In the course of some years of teaching classes in mathematical statistics, 
the author has expanded the treatment of the Pearson system of frequency 
functions begun in the Handbook of Mathematical Statistics^ into an exposition 
that he believes possesses marked advantages in unity, clarity, and elegance. 
This is accomplished by expressing the variable in standard units throughout 
and by making the two parameters 015(03 = | 3 i, 04 == (Si in Pearson’s notation) 
and 

, 2«4 - - 6 » 

i. + 3— 

fundamental in the discussion. The various formulae that arise are obtained 
directly and in a uniform manner and are relatively simple in form and easy 
to use. The criteria for the different members of the system of functions are 
expressed very simply in terms of as and S and the chart corresponding to the 
extension of the Rhind diagram given by Pearson^ takes on a strikingly simple 
form. 

Following the beginning made in the Handbook, the system of Pearson 
frequency functions are to be found among the solutions of the differential 
equation 


( 1 ) 


Idy a — t 

y dit 1^0 d" d” hi^ 


For those solutions y = /({) for which, 
Ho d- d- 


= 0 , 




* H. L. Rietz, Editor-in-Chief; Houghton-Mifflin Co., Boston (1924). See the chapter 
on Frequency Curves by H. C. Carver. 

* The notation used is that of the Handbook, loc, cit., to which reference will be fre- 
quently made. The discussion of Robert Henderson, "Frequency Curves and Moments,” 
Transactions of the Actuarial Society of America, Vol. VIII (1904), pp. 30-41, also proceeds 
along very similar lines, although Professor Carver was quite unaware of it when he wrote 
his chapter in the Handbook, The notation of the Handbook seems preferable however. 

* Karl Pearson: Mathematical Contributions to the Theory of Evolution^ XIX, Second 
Supplement to a Memoir on Skew Variation; Proo. Roy Soc., A. Vol. 216 (1916), plate 
opposite p. 456. 
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if r and s are the extremes of the range of variation for t, and for which the 
first a + 1 moments over this range exist, the recursion formula for moments, 

(2) oind + ria„_i&o -f- (n + l)Q;n&i -h (n + 2)a„+ii)2 = oin+i, 

can be derived. Then setting n = 0, 1, 2, 3 we get the following expressions 
for the parameters, a, bo, 6i, 1% in terms of as and 5: 

h = “3 

‘ 2(1 + 25) 

h - 

2(1 + 26) 

valid except when 5 = — Below note will be taken of those solutions for 
^shich the conditions imposed in deriving (2) are not satisfied. The case in 
which 5 = — ^ will be included in the discussion of the transitional types of 
functions. 

It is useful to note that 


a ~ — 


(3) 


K = 


2(1 + 26)’ 
2 + 6 


2(1 + 26) 


-2 < 6 < 2 . 


To show this, using a well-known device, we see that 

/(t) + Uy at = -4- 2Xa3 -h 


i: 


is never negative since f(t) S 0, r g t g s, for any real X. This requires that 


ag g ai . 


But 


+ 2-5114 

Q!4 ■]“ <5 Clf4 "p 

and the result follows. One consequence of this is that 6o cannot vanish for 

any Pearson frequency function possessing moments of the fourth order. 

Turning now to the integration of (1) and the development of the various 

forms of fit) that arise, it is useful to make the preliminary statements : 

1. Over the range of variation of t, we must have/(0 ^ 0. 

2. The area under curve y — f(t) over the range of variation must be finite. 
This being true then we always determine the constant of integration so 
that this area is unity. 

3. The range in each case is taken as the maximum one for which (1) and (2) 
may be secured which contains the point, t = 0. 

4. It is sufficient throughout to take ag ^ 0 since the curve for as = -k is 
only a reflection of that for as - k through the line t = 0. 


* See the Handbook, pp. 103, 104. 
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It seems best to follow the Handbook in disposing of three of the transitional 
types before proceeding to the main types of the system’ and then to the remain- 
ing transitional types. 

The discussion is planned to embody a direct and uniform method of treat- 
ment, giving simple formulae for the calculation of the parameters in terms of 
as and S in each case, and noting the salient features of each type of curve. 
The criteria for each type are expressed in terms of as and S, which for the 
whole system permit a simple graphical representation by means of the chart 
found at the end of this article. The construction of this chart is made clear 
in the deviation of the criteria. 


Transitional Type: The Nonna! Frequency Function: as = 5 = 0 


In this case (1) reduces to, 

Idy _ . 

ydt ~ 

from which 
(N) 

The range is, of course, (- <», <«) with C == (2 t)~*. On the chart, which we 
shall refer to as the (a^ 5) -diagram, we see that this function corresponds to 
but a single point. 

It may have the appearance of reasoning in a circle to use the values of the 
parameters given by (3), which were derived from (2), in solving (1) and then 
for the solution obtained examine the validity of (2). However, we may argue 
as follows: We will use the relations (3) as definitions of q, bo, bi, and b 2 in 
terms of as and 5 which are not yet defined. Using the values of a and the b's 
given by any choice of as and S, we solve (1). If the solution is such that for 
it (2) may be derived, then the relations (3) are valid when as and 8 have their 
usual meanings. For convenience let us denote the conditions for the validity 
of (2) by (A). It is obvious that conditions (A) are satisfied for 


(N) 


f(t) 



III, as ^ 0 , 8 = 0 

Transitional types < 

[X, if also a® = 4 . 

We get here (See the Handbook, loc. cit.): 

an) /(I) - M + i).’-. e-^' , 

if A = 2/as, tlie range being (—A, «>). 

It is readily verified that, since A* — 1 > — 1, conditions (A) are satisfied. 
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For A'^ > l-(i.e., for «3 < 4) the curve is bel)-shaped ; for < 1 it is J-shaped 
with an infinite ordinate at t = —A. For the bell-shaped curve the mode 
falls at t = —1/A and the mean — the mode = 1/A = «3/2. 

For A“ = 1, we have 

(X) fit) = V . 

which represents a J-shaped curve with the range ( — 1, «>). 

For A* 7^ 1, the function has been designated type III, the special case as 
type X. On the (as, 6)-chart the points corresponding to type III functions 
fall on the line 5 = 0, the type X functions being represented by a single point 
on this line. 

Turning now to the discussion of the three main types, we note that for 
5 0, 0 and that consequently the denominator on the right in (1) is 

always a quadratic which we can write in the form 

hait - ri) {t - Ti) 

in which neither ri nor rj can be zero (since ho ^ 0), and 

bi -f- 's/bi — ib(pi ~ ag — — 46(5 -(-2) — as -|- 

^ ~ 2l 2l ■ 

03 — 'n/S 

25 

Leaving aside the special case, ri = rj, to be dealt with later, we can always 
solve (1) in the form 

(6) /(O = C(< - - rt)”» 

with 

CL T\ 1 -j- 5 as 1 A* 2 5 
~ bg {n - ri)~ 5 ^s/D ® 

(6) 

ct “ Tg 1 6 Ob 1 !2 6 

^ ~ bi (rt - n) ~ 5 -y/D S 

For 5 < 0, the r’s are real and opposite in sign; for 5 > 0 and as < 45(5 + 2), 
the r’s are complex; and for 5 > 0 and ai > 45(5 A 2), the r’s are real and of 
the same sign. These three conditions with the additional condition that 
Os 5^ 0 give rise respectively to the main types of frequency functions designated 
I, IV, and VI. The points corresponding to them fall in simply determined 
areas on the (a*, 5)-chart. The boundaries of these areas, the curve, 

(2 A 35)o“ = 4(1 A 25)“ (2 A «), 
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which intersects the type I and type VI areas, and the line, 

5 = -1/2 

contain the points which correspond to the transitional types. 


MainTypel. as 0 , - 1 < 5 < 0 U 5 ^ - i (2 + 35)al 5 ^ 4(1 + 25)^ (2 + 5)] 
For da > 0, we see that 

ri < 0 < rj and that j ri j < | ra | . 


The range is taken to be (n, ra) and (5) is written 
(I) y = Cit - - 0"* • 


It is evident that the area under the curve over this interval is finite only 
when ffii + 1 > 0 and ma + 1 > 0 and that if these inequalities hold moments 
of all orders exist. In this case also conditions (j 4) are satisfied. Now 


JMl + 1 = 


7»a + 1 = 



and in the present c4se 


1 ± 


as 

Vd 


> 0 . 


Thus mi + 1 and ma + 1 are each > 0 only if 5 > — 1. On the chart, then, 
the points for 5 < —1 correspond to no frequency functions, — they fall in the 
“Impossible Area.” 

Further the type I curve will be U-shaped, J-shaped, or bell-shaped if both 
m’s are < 0, if the m’s are opposite in sign, or if both are > 0. We have 



Since for — 1 < 5 < — J, 


we see that mi < 0 {a, > 0 ) for S in this interval. For — i<3<0, mi>0 
only if 
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which leads to the condition: 

(2 + 35)a^ < 4(1 + 25)2 (2 + 5) 

Also, 


nh 




whence it is similarly seen that mj > 0 when -^ < 5 < 0, and that generally 
m2 > 0 only when 

(2 +■ 35)a? < 4(1 + 25)2 (2 + 5) . 

Thus the curve, 

(2 + 35)a’ = 4(1 + 25)2 (2 + 5) , 

being tangent to the line al = 0 at 5 = — divides the type I area on the chart 
into three parts; Above it lie the points corresponding to U-shaped curves, 
to the right of it the points corresponding to J-shaped curves, and below it 

the points corresponding to bell-shaped curves. (Note that for 5 < - - the 

o 

curves are always U-shaped.) 

Since r2 - ri > 0 and 62 ^ 0 accordingly as 5 ^ it is readily verified 
that n < a < u only for U- or bell-shaped curves. The sign of a is always 
opposite to that of aa for curves with a mode. Finally the constant is deter- 
mined by setting 


C f (t- ri}’”'(r2 — 0™ dt = 1 , 
J ri 


giving 


c = 


/s(wii -f 1, m 2 -f 1) (ra - ri)">‘+™+ 2 ' 

Main Tjrpe IV: aa 0, 5 > 0, and al < 45(5 -f 2) 
In this case we write: 


— aa , i V^ — D , . 

'■-■^ + -25 ’■ + ’*■ 


rt 


mi = 


1 -)- 5 aa ^ 1 25 


5 « 

With this notation (5) becomes 


vt 

— ^ — m, 


nit = 


Vi 


m. 
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and since, 

e t 

( a bi \ ^ tin'* b/a _ gi:{»/2 — tan'* a/b) ^ 

a -\-U/ 

the frequency function can be written, 


—V tan 


ri < + 


(IV) y = + r)* + s’*]"”® 

It is readily seen that m > 0, that v is opposite in sign to as, that 


—v tan 


can always be taken to lie between and and that the range can now 
be taken (— <», <»). 

In the previously discussed cases in which 6 g 0, if the area under the curve 
was finite moments of all orders existed. In the present case, the area and the 
first four moments are always finite but this may fail to be true of moments of 
higher orders. Tor, since 0 < & < 2, ‘ 


+ 25 
5 ^ 2 


and the integral. 



dt 


for f{t) given by (IV) will be finite for n g 4 and infinite for n = 5 if 5 ^ 1. 
In order for the n-th moment to exist we must have 


or 


2m > n + 1 


5 < 


2 


re - 3' 


Pearson designated as heterotypic those members of his system of frequency 
functions for which the eighth moment failed to exist. (In such a case the 
standard deviation of the fourth moment in samples would be infinite.) Set- 
ting re = 8, we get 5 = 2/5 as the deadline on the («J, 5)-chart. 

It was apparent that conditions (A) were satisfied for —1 < 5 < 0. (It 
will appear below that the case in which 5 = — | is no exception.) For 5 > 0 
it will be seen that it is generally true, as in the present case, that the formulae 
(2) and (3) can be derived if a „+2 exists, i.e., if 


S < 


2 


re - 1* 
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To determine 0, on setting the integral of (V) over the interval (— co, oo) 
equal to unity, we get 

^2m— I 

^ "" (?(2m - 2, v) 

in which 

G{2m — '2, v) = j sin^’"“*^e’''’ d<p ^ ^ _ tan-^ . 


Main Type VI: as 0, 5 > 0, «* > 45(5 + 2) 1(2 + 35)a3 7«^ 4(1 -f 25)2(2 + 5)] 

The conditions specify the remaining area on the chart. This may be left 
in the form 

(5) y = C{t - nr'it - 

Now Ti and rs are both opposite in sign to as, which, as usual, we will consider 
positive, and Ir^l > Irij. Always ms < 0 and mi ^ 0 accordingly as 

(2 + 3S)a^ ^ 4(1 + 25)2 (2 + 5) . 

We note that 

a — rj = ht{ri — ri)mt > 0, 
since now bi > 0, and that 

a — ri = iiin — r 2 )mi 

has the same sign as mi. Finally a < 0 . 

Thus for as > 0 and mi > 0, the point < = o on the axis of t lies to the right 
of both t = ri and t = rj. Also 


mi -|- ms = 


2(1 + 25) 
5 


-4 - 


2 

r 


The range is taken (ri, oo), the curve being bell-shaped when mi > 0. If 
mi < 0, the curve is J-shaped, t — a now lying to the left of < = ri. 

Since 

mi -h ms < —5, and mi -|- 1 >0, 

the area and the first four moments always exist. In order for the n-th moment 
to be finite, we must have 

— (mi -1- ms) > n -f 1 

which is the same condition as in the case of the type IV function, giving the 
same deadline, 5 = 2/5. 


‘ Cf : Tables for Statisticians and Biometricians, Cambridge Univ. Press, Part I, 2nd 
edition (1924), p. Ixxxi. 
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If the origin be shifted to the point, t = r,, we have writing, 
t - n- n — Ti — a, 

for the type VI function the expression, 

(VI) y = ~ 

with the range (a, oo). Finally 


+ 1, ~»»i — m 2 — 1) ■ 


Transitional T 3 T>e II: aa = 0, — 1 < 5 < 0. (5 — i) 


In this case, 


ri 



< 0 


mi = m2 


1 + 23 
5 


^ 0 


accordingly as 3 ^ — J . 


The frequency function is a special case of type I; setting, 

~ri = n = S 

mi = Tfii — M , 


we can write it in the form, 

(II) y = C(S* - IT- 


As in all cases in which a» = 0, the curve is symmetrical about the mean.* As 
in the type I case, the area and moments do not exist for 3 g — 1 ; for — 1 < 
6 < — i, the curve is U-shaped; for — ^ < 6 < 0, it is bell-shaped. The range 
is, of course, i—S,S). 

Finally, 

C 1 

" (2S)«''+i/3(M 4- 1, -I- 1) ■ 

Transitional Type VII; as = 0, 3 > 0 


This function may be regarded as a special case of type IV, with 

and 


,.o, ..V5|+l>0, ,.o, 


1 25 . « 

> 0, 


• It follows at once from the recursion formula, 

" 2 - (n”- 2) s “* 

obtained from setting the expressions (3) in (2), that on changing the sign of a,, the signs 
of all the odd monjents are changed. 
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and we write the function; 

(VII) 2/ = 

The type VII function may equally well be derived from the type II function 
by noting that 

S = is and M = —m. 

The range is (- «>, «>) however and for 5 S 2/5 the function is heterotypic. 
Finally 

^ r(m) 

- VS (72^- 


Transitional Type V; aa 0, ^ > 0, = 45(5 + 2) 


and we return to (1) to derive the form of the function, writing it: (The type V 
can also be derived as a limiting form of type VI) 

1 % _ q — t 
ydt~ bi(t + ry ' 

On integration we get 

JL a + r 

y = C{t -f- r) ‘>6 W<+r) 

^C{t + r) » e 

2 r (m— 1 J 

(V) =C(t4-r)-*”’e <+>• . 

We note that r has the same sign as aj and that wi = 2 + 1/5. The range is 
taken to be (— r, ± oo) accordingly as aj ^ 0. The curve is always bell- 
shaped. In order for the n-th moment to exist we must have as always when 
5 > 0, 

4 + 2/5 > n 4- 1 

leading to the same conclusions as in the type IV or VI case. Finally 

^ [2r(TO - 1)]“"*-' 

^ ~ r(2m - 1) ‘ 

Transitional Type VIII; a, = 0, 5 < (2 4- 35)a^ = 4(1 4- 25)^2 4- 5) 

The function is a special case of type I in which wii < 0 and Ws = 0. But 
when nil *= 0, nii = —2m, and the frequency function becomes 

(VIII) y = C(t- ri)-'" . 
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The range is (n, rj), the curve being J-shaped with an infinite ordinate at 
t ~ ri and a finite one at i = r*. In this case, 


( 7 - 


1 ~ 2m 

(r, - ■ 


(1 - 27 » > 1 ) 


Transitional Type IX; 0:3 0, < 6 < 0, (2 + 35 )a £3 == 4(1 + 25)*(2 4- S) 

We hffve another special type I function in which mi ~ 0 and mi = —2m > 0. 
The function is 

(IX) y = C(ri - 

the range still being (n, r^), the curve being J-shaped with a finite ordinate at 
t = rs. C has the same value as in the type VIII case. 


Transitional Type XI; a, 7 ^ 0, 0 < 5 < 2/5, (2 + 3S)al = 4(1 + 23)^2 + S) 

The function is a special type VI in which mi = 0, and = —2m<0, 
and we may Write it 

(XI) y^C{t~ n)-^^ 


with the range still (n, 00 ). 
t = n. Again, 


The curve is J-shaped with a finite ordinate at 


2w — 1 
(n — 



Transitional Type XII : S = — | 

If 5 = — the four linear equations derived from (2) from vhich the values 
of a, bo, bi, and 6 j in (3) are derived are inconsistent . Wc can however set the 
values (3) in the differential equation (1) and from its limiting form as 5 — v — 
derive the function appropriate to this case. 

We obtain 


1 dy — Qf3 — 2(1 -f- 25 ') t 

y dt (2 -|- 5) -|- 0 : 3 ^ M" 

and if 6 = —5, this becomes 



1 ^ 

__ 2 as 

2 a3 


y dt 

P - 2azt - 3 ~ 

(f ~ rO (t - Ti) 

with 




ri 

= as - 

- Vag -1- 3 , Vi 

= as -h ^/aj -f- 3 

On integration, 





y 

' = C'it - n)'"- (t - 

- I'a)’"’ , 



CHART FOR PEARSON ■ SYSTEM OP FREQUENCY CURVES 


27 


in which 


mj, = 7= -^- - > wij = 


— > "‘j = —7= 

"v O' j 3 a / a 

We observe that (aa > 0) 

rs > 0 > ri , 1 rj I > 1 n 

nii = —mi > 0 , 

Taking the range to be (n, rj), we write, 

(x„> 

the curve being J-shaped. Here 


3+3 


(ra' ~ n) j8(l - m 2 , 1 + W 2 ) ■ 

The values of the parameters and the form of the .function can also be derived 
as a special type I function in which 5 = — 

Finally we note that for aa = 0, (XII) reduces to 

y = C 


thus including the rectangular distribution function among the Pearson system. 

In the course of the above discussion a system of criteria for the various 
types of functions has been.set up in terms of as and b, in terms of which in 
every case the parameters may be readily calculated. The (a*, 5)-chart which 
makes these criteria visual is comparatively simple to construct and is strik- 
ingly simple in appearance. Besides the lines, 

1 2 

5 = — 1, 5 = — — , 5 = 0, a3 = 0 , 

2i 0 

it contains only the curves 

a* = 45(5 + 2) 

on which the points corresponding to the type V function lie, and the curve, 
(2 + 35)a| = 4(1 + 25)' (2 + 5) 

on which the points corresponding to the functions of types VIII, IX, X, and 
XI are found. I must take occasion to express my thanks to Mr. Simon Yang 
who constructed this chart for me. 









RANK CORRELATION AND TESTS OF SIGNIFICANCE INVOLVING 
NO ASSUMPTION OF NORMALITY*' f 

By Harold Hotelling and Margaret Richards Pabst 

1. Dependence of Tests of Significance on Normality 

The powerful tests of significance, largely the work of R. A: Fisher, which have 
been revolutionizing statistical theory and practice, are in the main based 6n 
the assumption of a normal distribution in a hypothetical population from which 
the observations are a random sample. The nature and extent of the errors 
likely to result from the application of a test of significance assumiiig normality, 
where normality does not really exist, have been the subject of investigations 
both experimental and mathematical,^ which however have not produced 
satisfactory substitutes for Fisher’s methods. A false assumption of normality 
does not usually give rise to serious errors in the interpretation of siihple means, 
since the distribution of a mean of any considerable number of cases is very 
nearly nonniil, no matter what the nature of the parent population, so long as 
it doc'.s not fall within a certain class having infinite range, and including the 
Cauchy distribution. The sampling distributions of second-order statistics are 
however more seriously disturbed by lack of normality, as is evident from their 
standard errors. For example the variance (Mt - of sample variances is 
much affected if differs considerably, as it often does, from the value 3 
which it takes for a normal distribution. Likewise the approximate variance 
of the correlation coefficient, ' 

2 ^ / I r MoiMtl , M22Mll 

0- ^ - < ftu -b - , ■+• ^ b 

^^20/^03 \ 4^2 0 Iho Mo 2 "/ijoMlB 


* Research under a grant-in-aid from the Carnegie Corporation of New York. 

t Presented to the American Mathematical Society at New York, Oct. 26, 1935. 

I J L. Carlson,! A Study of the Distribution of Means Estimated from Small Samples by the 
Method of Maximum Likelihood for Pearson's Type II Curve, Unpublished M. A. Thesis, 
Leland Stanford Junior University, 1931. 

Leone Chesire, Elena Oldis and Egon S. Pearson, Further Experiments on the Sampling 
Distribution of the Correlation Coefficient, Journal of the American Statistical Association, 
June, 1932, pp. 121-128. 

Victor Perlo, On the Distribution of Student’s Ratio for Samples of Three Drawn from a 
Rectangular Distribution, Biometrika, Vol. XXV, Parts I and 11, May, 1933, pp 203-204. 

Paul R. Rider, On the Distribution of the Ratio of Mean to Standard Deviation in Small 
Samples from Non-Normal Universes, Biometrika, Vol. XXI, Parts I to IV, December, 1929, 
pp. 124-143. 

H L. Rietz, Note on the Distribution of the Standard Deviation, etc., Biometrika, Vol. 
XXIII, 1931, pp. 424-426. 

W. A Shewhart and P. W. Winters, Small Samples— New Experimental Results, Bell 
Telephone Laboratories, Reprint B-327, July, 1928. 
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where Pi, is the mean value of z'y^, and >iio = moi = 0, may be substantially 
different from the value (1 ~ p®)Vw commonly used, to w'hich it reduces if the 
population has the bivariate normal distribution. It is however remarkable 
that if the variates are really independent, so that nn — 0 and nn = msoMos; this 
formula reduces to 


regardless of the form of the distribution. It should of course be remembered 
that these formulae give only the first term of an expansion in inverse powers of 
n, and also that the standard error fails for small samples to characterize the 
distribution adequately. But the sensitiveness of the standard error formula to 
deviations from normality in the population is a symptom of the grave dangers 
in using even those distributions which for normal populations are accurate, 
in the absence of definite evidence of normality. 

To substitute in standard error formulae values of the higher moments esti- 
mated from the data does not meet the difficulty satisfactorily, since these higher 
moments are themselves subject to sampling errors which are often large, and 
since no exact distributions can ever be obtained in this way. The use of an 
arbitrary system of distributions such as the Pearson curves is subject to the 
same criticisms as that of the normal distribution. These and other special 
distributions may indeed he justified in special cases by general reasoning; an 
example of this in introducing a measure of relationship other than the correlation 
coefficient is to be found in the genetic discussion of Chapter 9 of Pisher’s “Sta- 
tistical Methods for Research Workers." But for a great deal of statistical work 
no such a priori reasoning is available and sufficient to specify a distribution in 
sufficient detail. If a specific form of distribution other than the normal can be 
relied on in a particular case, the mathematical problem of finding the exact 
distribution of the appropriate statistic will still commonly be found difficult or 
impossible. 


2. Tests Independent of Normality Assumptions 

A set of problems is thus encountered regarding the nature and methods of 
statistical inference possible without assuming any particular distribution of the 
variates in the population from which we have a sample. Tests of significance 
underlying such inferences must clearly be invariant under all transformations 
of each variate. We are thus forced to rely for our information on relations of 
order, or of qualitative classification, rather than upon magnitudes, excepting 
insofar as we can use inequalities such as that of Tchebycheff. Classification 
leads to the use of contingency tables, from which accurate probabilities are 
calculable for testing whether or not the two or more principles of cross-classifica- 
tion used are independent. If the probability obtained is so small as to render 
it incredible that independence exists, the further problem arises of measuring 
the degree of relationship ; but in the absence of special assumptions, such as that 
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of the bivariate normal distribution, or those in Fisher’s genetic example men- 
tioned above, the problem of measuring degree of relationship is insoluble. Any 
measure of degree of relationship will change its value, unless this value corre- 
sponds to independence, when transformations other than those of a restricted 
class are applied to one of the variates. The problem of measuring degree of 
relationship, or correlation, is thus of quite a different character from that of 
testing the existence of a relationship, which is equivalent to absence of inde- 
pendence. The existence of correlation may be detected by methods of rank 
order or of classification; these can never, by themselves, be sufficient for its 
measurement. 

To test the deviation of the center of a symmetrical population from some 
definite hypothetical value, Student’s distribution, which is appropriate when 
the population is normal, may be replaced by the binomial distribution, which 
will sometimes show that the preponderance of cases on one side of the hypotheti- 
cal value is too great to admit the hypothesis. Fisher applied this principle to 
Student's original example, showing at the same time that it can in certain eases 
be used to test the significance of the difference between the means of two 
samples.*® Both this type of test and the use of contingency tables with grouped 
values of variates bring out clearly the fact that abandonment of the assumption 
of normality is equivalent to a certain loss of information, larger samples being 
required to make up for the lack of knowledge of the form of the population. 
The loss of information is greater for contingency tables arranged according to 
the values of the variates than when an appropriate method of rank correlation 
IS used, for the contingency table may be regarded as derived from the ranks by 
grouping them, thus discarding some of the information. 

We shall in §8 illustrate a combination of rank and contingency methods 
suitable for utilizing simultaneously two kinds of information contained in 
grouped data 

For large samples a method of treatment for which a great deal is to be said in 
many cases consists of replacing the observed variate by a new variate x to which 
a value is assigned for each individual or frequency class by interpolation in a 
table of the normal probability integral, in such a way that the distribution of x 
m the sample approximates normality. If this is done for each of two variates 
which do not have the bivariate normal distribution, the transformed values x 
and y may also lack the bivariate normal distribution, even approximately, 
though each is normally distributed, so far as we can speak of a sample as being 
normally distributed. Even if the bivariate distribution is normal, the correla- 
tion coefficient of x and y will not have the same distribution as the correlation 
coefficient in samples drawn from a bivariate normal distribution, since in the 
latter case the distributions of x and y separately would in most samples be less 
nearly normal than when the transformation to^ approximate normality is 
applied. From these considerations it follows that for the detection of correla- 
tion the normalizing transformation cannot be said in general to be the best 

* R. A Fisher, Statistical Methods for Research Workers, Art. 24, end. 
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method, even for large samples, though it may be a useful preliminary to the 
application of the method of least squares or to the use of correlation coefficients 
significantly different from zero in certain cases. 

3. The Raolc Correlation Coefficient 

Suppose that n individuals are arranged in two orders with respect to two 
different attributes. Thus we might arrange a freshman class in order according 
to their grades in a language examination, and also according to their mathemat- 
ical grades. As another example, we might be able to obtain ratings of various 
states with respect to penal law or practice, and also with respect to amount of 
crime. Continuous variates expressing these qualities are likely not to be nor- 
mally distributed, so that the product-moment correlation coefficient r cannot 
be expected to have the exact distribution known, for it in the case of samples 
from a normal population. We may therefore resort to the ranks, ignoring any 
exact values that have been assigned. 

Calling Xi the rank of the ith individual with respect to one attribute, and ‘7 < 
his rank with respect to the other, so that {X\, X^, •• • , X„) and (7i, 7s, • • • , 
7n) are two permutations of the numbers (1, 2, • • • , n), let us put X( = Xi ~ 
y, = Y, ~ y, where 


The rank correlation coefficient is defined as 


( 2 ) 

the sums being over the n values in the sample. Now since the sum of the first 
n integers is n(n -f l)/2, and the sum of their squares is n(n -f l)(2n l)/6, 

we have 

Sx* = S (X - »)* = S X* - (S X)Vn 

_ n(n -f 1) (2n 1) n(n 1)* n’ — n 

“6 4 1^’ 

and Sy* has the same value. Also, if we put d,- for the difference between the 
two ranks for the rth individual, so that 

d{ = X, — 7, = Is — j/S) 

we have 


2d^ = Si* - 2Siy + Sy* = ~22xy. 

Substituting in (2) the valqe of Siy found from this equation, and also the values 
just obtained for Si* and Sy*, we have: 



HANK CORRELATIorr AND TESTS OF SIGNIFICANCE 


33 


(4) 


1 - 


v? — n‘ 


This is the most convenient formula for computing r' . 

Compared with certain other tests of correlation based on order, such as S j d |, 
or the number of inversions required to pass from one permutation of the -n 
numbers to the other, r' appears to be a sensitive index of relationship, since for 
a given value of n it possesses a greater number of distinct values. But to 
assert without qualification that r' or any other statistic is the best possible test 
of correlation based on order relations alone would be meaningless. Indeed, a 
particular type of bivariate distribution might well have a parameter represent- 
ing correlation whose significance could best be detected by a test adapted only 
to this particular bivariate distribution. However the rank correlation coeffi- 
cient has properties that point to its value in more general use than it has hereto- 
fore received. It has been regarded chiefly as a more easily calculable substitute 
for the product-moment coefficient r. Karl Pearson has remarked that the rank 
correlation coefficient is the easier to compute for samples smaller than approxi- 
mately forty, while r involves less labor for larger samples. 

The great value of the rank correlation coefficient appears to us to consist in 
its use as a test of the existence of correlation, a test capable of exact interpreta- 
tion in terms of probability, without any assumption of a normal or other special 
bivariate distribution. If a bivariate distribution is specified by /(a:, ?/) d% dy, 
the condition of independence is that /(a:, y) shall be the product of a function of 
x by a function of y. If we put 

(5) ^ = f [ f{z', y') dx' dy' , ,] = f f f(x', y') dx' dy' , 

J-kJo Jo 

using the inner integral sign in ehch case to correspond to the inner differential, 
then each of the quantities $ and v is distributed with uniform density from 
— j to -f- 1 ; and if x and y are independent, then ^ and rj are also independent. 
The correlation p' of ^ with »; may be called the rank correlation of x and y 
in the population. It will vanish in case of independence. It is for this case 
that we shall obtain in §§5, 6 and 7 the exact probability test for r' in small 
samples, the exact standard error and fourth moment, and asymptotic values for 
the higher moments, with a demonstration that, for sufficiently large samples, 
r' can be treated as normally distributed. In §9 we shall present, m a revised 
and simplified form, certain work of Karl Pearson relative to the estimation of 
the correlation p in a bivariate normal distribution, and apply the results to 
discuss the question of the importance of the lost information when measure- 
ments are replaced by ranks, 

4. History of Rank Correlation Theory 

Rank correlation seems to have had its origin in the method of representing 
the distribution of a variate by grades or percentiles introduced by Francis 
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Galton.® Later Spearman* proposed that rank be considered in place of the 
variate, and suggested that the correlation of ranks be used as a measure of the 
degree of dependence of the variates. Spearman also introduced the “footrule 
of correlation” based on 2 1 d j. 

The principal memoir on rank correlation is by Karl Pearson.^ Assuming an 
underlying normal distribution, Pearson obtains a relation equivalent to 

(6) P = 2 sin g p' , 

where p is the correlation of x and y in the population, and p' is the correlation of 
uniformized variates ^ and rj defined by (4). An estimate r" of p may be based 
on the rank correlation r', in accordance with (6) , by writing 

(7) r" = 2 sin g r' . 

Pearson finds the first few terms of infinite series giving the standard errors of 
r' and r". He deals similarly with the estimation of correlation by means of 
2 ) d 1 . The paper contains a neat proof, attributed to Student, of the probable 
error of r' under conditions of independence. It was this proof that sug- 
gested the analysis of §§6 and 7 below. This long memoir is very difficult to 
read and interpret accurately, owing chiefly to the failure to distinguish clearly 
between sample and population. 

The use of the probable error formulae is valid only if the distributions of r' 
and r" are sensibly normal. The question of approximate normality thus raised 
is investigated for the first time in the present paper. In order to use these 
formulae it is necessary to assume not only (1) that the underlying population 
h'as the bivariate normal distribution (an assumption which requires more than 
that each variate be normally distributed), (2) that the first few terms of the 
infinite series are enough, and (3) that the distributions of r' and r" are practi- 
cally normal, but also (4) that sample values can be put for population values 
in the formulae, or that population values are known independently or can be 
assumed. It is probably this last condition that has been least understood and 
has led to the greatest number of false conclusions regarding the significance of 
data. 

A note by W. C. Eells* presents a compilation of numerous textbook versions 
of the probable errors of r' and r", all differing from each other and from Pear- 

‘ Francis Galton, Natural Inheritance, Macmillan, 1889, Chaps. 4 and 5.' 

* C, Spearman, The Proof and Measurement of Association Between Two Things, American 
Journal of Psychology, Vol. 15, 1904. 

‘Karl Pearson, On Further Methods of Determining Correlation, Drapers’ Company 
Research Memoirs, Biometric Series IV, Mathematical Contributions to the Theory of 
Evolution, XVI, London, Dulau, 1907. 

* W. C. Fells, Formulas for Prohdble Errors of Coefficients of Correlation, Journal of the 
American Statistical Association, Vol. 24, 1929, p. 170. 
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son s. Taking Pearson s formulae as correct, without discussing the assump- 
tions implicit in their use, Eells presents a table for calculating the probable 
errors of r, r' and r" . 

S. Significance of Rank Correlation in Small Samples 

If the variates are independent we may without loss of generality assign the 
values 1,2, • ■ • , n in order to Zi, X 2 , • • ■ , X„, and regard the F’s as made up 
by any one of the n! permutations of these numbers, all permutations bemg 
equally probable. The probability of any particular value of r' is thus proporr 
tional to the number of permutations giving rise to this value. These may be 
enumerated with the help of (4). Thus for n = 2, each of the values ± 1 has 
the probability For n = 3, the possible values of r' are -1, i 1, with 
respective probabilities 1/6, 1/3, 1/3, 1/6. Tor n = 4 the values 1, 4/6, 3/5, 
2/5, 1/5, 0 have the respective probabilities 1/24, 1/8, 1/24, 1/6, 1/12, 1/12. 

From (2) it is evident that the distribution of r' in case of independence is 
symmetrical, since each permutation is exactly as probable as that of directly 
opposite order, and since a change of sign of all the x’s or y’s changes the sign of 
r' without affecting its absolute value. It is clear also that the values r' = ±1, 
corresponding to the two variates being in the same or opposite orders, are the 
extreme ones, and have each a probability 1/n!. The next greatest value of 
1 t' 1 corresponds to the interchange of two consecutive individuals, who may be 
selected in n — 1 ways and makes = 2 Thus the values ±(1 — 12/[n® — n]) 

occur with probability (n — l)/n! each. Next to these, corresponding to 
= 4, are the values ±(1 — 24/[n’ — n]), whose probabilities are each 
(n — 2)(n — 3)/2(n!), since the numbers of pairs of mutually exclusive consec- 
utive pairs in a sequence of n is (n — 2)(n — 3)/2. In like manner, but with 
greater complexity, it appears that the probability of the value 1 — 36/[«® — n] 

is ~ ~ ^ ~ ~ Easy calculation from these results 

shows that, if we require for significance a probability P = .01 of a value of 1 r' j 
as great as or greater than the value observed, then for samples of 5 it is 
impossible to obtain a significant value; for n = 6, significance requires that 
r' = ±1; and for n = 7 the significant values of ] r' j are 25/28 and more. For 
the less stringent standard P = .05, a unit correlation only is significant in a 
sample of 5; while 29/35 is not, but 31/35 is, significant in a sample of 6. 

6. The Standard Error and Fourth Moment 

For large samples the exact calculation of probabilities becomes very laborious, 
and we are forced to resort to approximations. The first step in the available 
approximations is the determination of the standard deviation of the distribu- 
tion. The square of this quantity, the second moment or variance of r\ may, 
since the mean value of r' in case of independence is zero, be written 

c\, = n, = Er'\ 
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the symbol E denotmg the expectation or mean value of the quantity followmg. 
The operation E has the properties that the expectation of a sura is the sura of 
the expectations of the terms, the expectation of the product of independent 
variates is the product of their expectations, and the expectation of the product 
of a constant by a variate is the product of the constant by the expectation of 
the variate. It is particularly to be noted that the first of these properties holds 
whether the terms of the sum are mutually independent or not. 

From (2) and (3) we have 

( 8 ) 

n® — n 


Now we may regard Xi, Xi, as taking the same values in all samples, these 

values bemg centered at zero and differing consecutively by unity. The y’a are 
then variates, not independent of each other, taking this same set of values, but 
in a manner varying from sample to sample by chance. ■ For any particular y, 
for example that associated with Xi, the chance distribution has moments of the 
form 


( 9 ) 


Ey’- = 





£p 

n 


f 


if we denote by Sp the sum of the pth powers of the n numbers differing consecu- 
tively by unity and centered at zero. It is clear that, for every odd value of p, 
Sp — 0. Also, from (3), 

— n 

s, ^ . 

In view of these facts, we have from (8), 

-2 _ _ E{'Zxy} _ 'Lx^Ey'^' -|- 2^ XiX^Eyiyt 

a,, - xir , 

^2 §2 

where 2210:2 stands for the sum of all the n{n — l)/2 different terms obtained by 
permuting the subscripts. We have 


also 


EyiVi = 


2221X2 
n{n — 1) 


t 


2 2 X 1 X 2 — Sj — $2 “ — * 

Combining these results we have: 


( 10 ) 


1/s^ I 1 

n(n - 1)/ n - 1‘ 


This is the formula obtained by Student and incorporated in Pearson’s memoir. 
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Any desired moment of r' may be obtained in this manner. However the 
complexity oi' the calculation increases rapidly with the order of the moment, 
and the derivation of even the fourth moment is too long to be included in this 
paper. The value obtained for the fourth moment is 

3(25n^ - 13# - 73# + 37n + 72) 

25n(n + l)Hn - 1)3 


It will be observed immediately that the kurtosis, approaches the 

normal value 3 as n increases. 

For values of n which are not small enough for the exact probabilities to be 
computed easily, the TchebychefE inequality. 


( 11 ) 


P ^ 


1 

(n — 1) r'^ ’ 


where P is the probability of a deviation exceeding r', will often be of service. 
Thus, if n = 25 and r' = .9, (11) shows that P is less than .05, so that the 
evidence for existence of a relationship should by an ordinary standard be re- 
garded as significant. However this does not in general give an accurate 
approximation to P, nor do the similar inequalities involving the higher moments. 


7. The Higher Moments and the Approach to Normality 

A general moment of r' of even order is defined by 

(12) ^2a = E + ^21/2 + • • • + ^nyn)-” • 

^2 


When the parenthesis is expanded we may take the expectation term by term, 
regarding the z’s as constants. Now 


Eyl’^ = 


S xl“ 


EyV 


Vi 


n (n — 1) ’ 


and so forth, the sums on the right in the numerators being symmetric functions 
of the constants x, taken over all different terms obtamed from that written by 
permuting subscripts, and the denominator being in each case the number of 
terms in the numerator. Thus 


(13) 


Ilia = 


1 /(2xJ“)3 


82 “ 


n 




(2xf“~* X2)2 
n (« — 1) 


+ B 


(Sxi“ ^ XjXi)^ 

n (n — 1) (n — 2) 


+ 


where the coefficients A, B, ■ • • depend on a but not on n. With a view to 
determining the leading term in the expansion of Msa in powers of n~3, we shall 
select the term in the curly brackets in (13) of highest degree, meaning by the 
degree of one of these rational fractions the excess of the degree of the numerator 
over that of the denominator. 

The symmetric functions are well known to be expressible as polynomials in 
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the power-sums Sp. In each term of ■ ..ch a polynomial corresponding to one of 
our symmetric function of degree 2a, the sum of the subscripts of the Sp’s must 
be 2a, since if all the x’s are multiplied by a constant such a polynomial must be 
multiplied by the 2ath power of the constant. Now Sp is a polynomial of degree 
p + 1 m 71 , if n is even, but vanishes identically if ti is odd. Consequently the 
degree in n of any of the terms of the polynomial in the power-sums must exceed 
2a by the number of power-sums appearing in this term. Therefore, the term of 
highest degree in n obtained, when one of the symmetric functions is expressed 
in terms of the Sp’s and thence in terms of n, must contain the greatest possible 
number of the Sp’s. If p is the number of distinct x’s in a term of one of our 
symmetric functions, this function may be written in the form 

2Xi’X 2 • • * “ CoSa,S(,, * • • Sop—i ^ap ClSpi^^ip Sa, • • * Sop — , 

(14-) C2Sa,Sa,.fap * * * Siip_, ...... Cp^lSa,Sa, ’ * ■ Sap— i+Op 

" C ^a|4.af4'Op i “ * • • j ^ 

where Oi -f a* -f • • • -|- Op = 2a, and the c's do not involve n. In the right-hand 
member of the equation above, the first term involves p of the power-sums, while 
the remaining terms involve fewer of them. Hence, if all the indices ai, Oj, • . . , Op 
are even, the first term is a polynomial of degree 2a -f p in n, while the remain- 
ing terms are polynomials of lower degree, and are therefore negligible in compari- 
son with the first term when n is sufficiently large. But if any of the indices a< 
are odd, the first term vanishes identically, and the degree of (14), regarded as a 
polynomial in n, is then less than 2a -}- p. Smce the sum of the indices is 2a, the 
number of odd ones among them must be even; let this number be denoted by 2g, 
and let the number of even indices be m. Then p = m + 2g. The terms of 
highest degree in the right-hand member of (14) mpst be obtained by grouping 
the odd indices in pairs to form the subscripts of the s’s. The degree is therefore 
2a -j- m + g. 

In (13), the degree of the denominator of each term in the curly brackets is the 
number of distinct x’s appearing in a t§rm of the symmetric function in the 
numerator, namely p, or tti 2g. Hence the excess of the degree of the numera- 
tor over that of the denominator is 

2 (2a -f- 771 -f- g) — (m -|- 2?) = 4a + m . 

This will be a maximum when tti is a maximum, and is independent of g. The 
maximum value of tti is a, and occurs only for the sjunmetric function 

(15) S x’x? • . . x^ 

The term involving this function is therefore the only one in the right-hand 
member of (13) we need consider. Since this symmetric function contains 
n(n — l)(n 2) ■ ■ ■ (n ~ a -f- 1)/ (a!) terms, and since in the expansion of 

(XiPi -f XjT/j -I -f- X„J/n)““ 
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the coefficient of x\xl • • • xly\y\ •••y* is, by the multinomial theorem 
(2a) !/2", we have from (13), 

.. 1 (2a)! a! (S a:Jx 2 • • • a:®)2 

s^“ 2“ • 

To evaluate the symmetric function (15), so far as the term of highest order in, 
n is concerned, we of course need only the first term of (14), which reduces in 
this case to 

2 xlxl . . . a:* = CflS? - . . - . 

In the expansion of «“ = (a:? -f a:’ + • • • a:*)“, the coefficient of (15) is al, 

which is therefore the reciprocal of co. Thus we obtain 



the terms dropped being of higher order in n“*. 

■T he 2ath moment of the quotient of r' by its standard error, that is, of 
r' Vn — 1, is (n — 1)“ times that of r', and therefore approaches, as n increases, 
the value 


( 16 ) 


(2a)! 
a! 2“' 


The odd moments are all zero because of the symmetry of the distribution of r'. 
But (16) is the moment of order 2a of a normal distribution of unit variance and 
zero mean. It follows therefore from the Second Limit Theorem of Probability^ 
that the distribution tends to normality as n increases; that is, for an y real 
number X, the limit as n tends to infinity of the probability that r' s/n— 1 -CXis 



The normality of the limiting distribution of the rank correlation coefficient 
is rather remarkable, since r', unlike the product-moment correlation coefficient 
r and other statistics in common use, is neither a mean of independent quantities 
nor a function of such means, so that the ultimate normality just established is 
not a corollary of known general theorems. It is unexpected also because the 
exact distribution of r' for samples smaller than six might lead one to anticipate 
a bimodal distribution. 

An outstanding problem is to determine whether , the distribution of r' in 
samples from a bivariate normal distribution for which p 9 ^ Q converges to nor- 
mality. Without such an approach to normality, the probable error formulae 


’’ First proved by Markoff. Cf. Fr6ohet and Shohat, A Proof of the Generalited Second 
Limit Theorem in the Theory of Probability, Transactions of the, American Mathematical 
Society, Vol. 33, 1932, pp. 533-543. 
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discovered by Pearson are useless. Another problem is to find convenient and 
accurate approxunations to the distribution of r', for moderate values of n, with 
close limits of error. A table calculated along the lines suggested m §5 would 
be very useful. 

8. Combination of Rank and Contingency Methods 

Suppose that a thousand school children are examined at the end of a course 
of instruction, and rated with the grades A, B, C and D Five hundred of 
the.sG children are of each sex The results are; 



A 

B 

C 

D 

Totals 

Boys 

190 

200 

80 

30 

500 

Girls. . 

220 

200 

60 

20 

500 

Totals. . . . 

410 

400 


60 

1000 

Proportion of Girls 

537 

500 

.429 

400 

500 


Regarding this as a 2 x 4 contingency table with three degrees of freedom, we 
calculate = 7 52, the probability of which value bemg exceeded by chance 
is .0570 The indications of a significant difference in distribution of grades 
between sexes may thus, if one holds to the .05 standard and uses only the x* 
test, be regarded as not quite significant. There is, however, additional evidence 
in the fact that the proportion of girls diminishes steadily as we pass down the 
scale of grades. If we treat excellence in the subject as one variate and the 
proportion of girls in a group as another, we have a rank correlation of unity, 
with a sample of four. The probability of a correlation of ±1 is .083, which 
also, by itself, would not be considered significant. But we may combine the 
two pieces of evidence by the method given by Fisher.® The process consists 
of adding the natural logarithms of the two probabilities, doubling, and treating 
the result as having the x’ distribution with four degrees of freedom. This gives 
a probability in the neighborhood of .03, which would be judged significant. 

Similar cases are very common. The value of x^ is unchanged if the columns 
are permuted m any way, whereas r' depends solely on which of the possible 
permutations actually exists. Thus the two tests are independent, a property 
needed for the combination by the above method. 

9. Efficiency of Replacement of Measures by Ranks, and the Estimation of 
p from Rank Correlation, for a Normal Population 

Consider a population with a normal distribution in two variates x and y, 
each of which we shall without loss of generality assume to be of unit variance 
and zero mean. The density distribution is then specified by z dx dy, where 

' R. A. Fisher, Hlaii&lical Methods for Research Workers, 4th and 5th editions. Art. 21 1. 
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(17) 2 = 

27r Vl - p" 

where p is the correlation of x and y, or the variate correlation. By ^ and 17, 
as in §3, we denote the uniformized variates defined by (5), i.e,, functions re- 
spectively of X and y having distributions of uniform density from — | to -f- 
Then ^ and t) will each have the variance 1/12. The rank correlation p' in the 
population is the correlation of { and v’, consequently 

(18) p' = 12 r I ^rjzdxdy . 

J— «o y~eo 


Thus p' is a function of p, which obviously vanishes when p = 0. 
From (17) the identity 


(19) 


dp dx dy 


is readily calculated. With its help we have from (18) and integrations by parts, 


(20) 


dp' 

dp 


r - /.I I L L ^ ST, * 

.12 r 

y— ,09 J—eo dx dy 


dy 


Now since x and y are normally distributed with unit variance and zero means, 
the uniformized variates (5) take the form 


Therefore 



_ M 

e ^ dt, 


dx -v/27r 


— 

2 

} 



» dt. 


^ = l_c~ 

dy VZir 


si 

2 


Substituting these values and (17) in the 

dp' 12 r 
dp \/l — p* J-K J-<a 


last integral in (20) we have, 

(2 — 2 f*)y* 

2(j-p*) dxdy. 


The double integral, as is well known, equals tt divided by the square root of the 
discriminant of the quadratic form in the exponent. This gives 
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Therefore, since p' vanishes with p, 


S. 



or 



This is essentially the process used by Pearson. 

The last equation suggests that an estimate r" of p be based on the rank 
correlation r' by means of the relation 

r = 2 sin . 

o 

Prefixing a 5 to denote a deviation of sample from population value we have by a 
Taylor expansion, 

Jr" = ^ cos ^ ar' + • • • , 


the terms dropped being of higher order in 5r' than those written, and conse- 
quently of higher order in n-K Squaring, taking the expectation, and ignoring 
the terms of higher order, we have for the case p = p' = 0, by (10), 



P(5r")2 



9(n - 1) ’ 


approximately. 

The last result enables us to measure the loss of information, at least for large 
samples, that results from neglecting the exact values of the variates and using 
only ranks. The product-moment correlation coeflficient r has, if p = 0, the 
exact variance 


1 

n - 1 ’ 

the ratio of which to <7 tends as n increases to 9/r^. Thus the efficiency of the 
rank correlation method in estimating p, if p is sw^lly zero, is O/tt® = .9119. 
This means that the product-moment correlation is approximately as sensitive 
a test of the existence of a relationship in a normally distributed population with 
91 cases as the rank correlation with 100 cases. 

The efficiency of r' will of course be different for non-normal populations, and 
also for normal populations with p ^ 0. But if the form of the population is 
known, this knowledge may always be used to supplement the ranks to obtain a 
more accurate estimate of correlation, or test of relationship. This fact deserves 
some attention, since a superficial observation of the coincidence of the formula (1) 



RANK CORRELATION AND TESTS OF SIGNIFICANCE 


43 


for the leading term of the variance of an arbitrary uncorrelated population, 
and the leading term of the formula (10) for the variance of the rank correlation, 
might suggest that r' is as accurate as r. But it may be surmised that the 9 % 
loss of information found for the bivariate noripal distribution is the greatest loss 
of information in using r' in place of r to test for independence, since for non- 
normal populations the most efficient estimate of the correlation will not usually 
be r, but a more complicated function of the observations. Certainly where 
there is complete absence of knowledge of the form of the bivariate distribution, 
and especially if it is believed not to be normal, the rank correlation coefficient 
is to be strongly recommended as a means of testing the existence of relationship. 

Columbia University. 



THE ELIMINATION OF PERPETUAL CALENDARS 

By John L Roberts 

If we wish to find the day of the week for any date, one way to solve the 
problem is to use a perpetual calendar. Another way to solve the problem is 
to calculate the day of the week by mathematical methods In the past these 
mathematical methods have been so complicated that it has been much more 
convenient to use a perpetual calendar. This explains why some people have 
put themselves to the expense of buying perpetual calendars. The purpose of 
this article is to provide a mathematical method which is so simple that the 
entire calculation can be done mentally and which is as convenient as a per- 
petual calendar. In this article this mathematical method is applied to the 
Gregorian, Julian, and World calendars. Since a great many records have 
been made using the Julian and Gregorian calendars, the adoption of the World 
calendar would not completely eliminate the usefulness of applying the mathe- 
matical method to the historical calendars. The mathematical method also 
shows to what extent the World calendar is a simplification ; this is important 
because proposals to reform the present calendar are attracting world-wide 
attention. 

In the theory of numbers occurs the expression, 

ash mod p, (1) 

which is read a is congruent to h modulo p, and which means that the differ- 
ence of a and h is divisible by p. Since p in this article is always equal to 7, 
it is convenient to represent (1) by 

ash, (2) 

Assume m stands for any number which represents any monthday of any 
month. Assume w stands for any number which represents any day of the 
week. It is assumed that 7 stands for Sunday, 1 for Monday, 2 for Tuesday, 
etc. It is assumed that the constant c for any month is the value of m at the 
first Sunday in that month. Then (2) becomes 

w = m - c, (3) 

which enables us to find w if w is known provided the constant c is known for 
the month in question. Consequently, all we need to complete our theory is 
to discover a method of finding c for any possible month. 

First, there will be discussed rules for finding c for any month of the Gregorian 
calendar in 1935. An inspection of the calendar shows that c for December is 
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equal to 1. Since November has 30 days, we can find c for it by adding 2, 
which is congruent to 30, to the c for December. Since the number of days in 
September, October, and November is 91, which is congruent to zero, the c’s 
for September and December have the same value. In like manner, since c 
for September is 1, the c for June is 2, and the c for March is 3. We now have 
all the theory which is necessary to find w at any date in 1935. For example, 
suppose we wish to find w for April 17, and know that the c for December is 1. 
Then, by adding 2 we find that the c for March is 3. We are now in position 
easily to calculate that the c for April is 7. Applying (3) we find that w at 
April 17 is 3, which stands for Wednesday. 

All that is necessary to complete our theory of the Gregorian calendar is to 
find rules for finding c for December of any possible year, because, if this is 
known, we can find c for any month in that year by the method used for 1935. 
It is convenient to represent the expression, “c for December 1935” by “C for 
1935.” In like manner C for any calendar year means c for December of that 
year. Since C for 1935 is 1 and since the number of days in 1936 is 366, which 
is congruent to 2, subtracting this 2, we find that C for 1936 is 6, because —1 
is congruent to 6. Knowing C for 1936, we deduce that C for 1940, which is 
four years later, is 1, because 6 -f- 2 is congruent to 1; and that C for 1928 is 2, 
found by subtracting 4. The C’s for 1900, 1928, 1956, and 1984 are equal. 
Full centuries in order to be leap years must be divisible by 400. Since C 
for 1900 is 2, we find by adding 1 that C for 2000 is 3. Knowing C for 2000, 
we deduce by adding 2 that C for 2100 is 5 1600, 2000, and 2400 have the 

same value of C. If it is assumed that the length of the tropical year is exactly 
365.2425 days, we have all the theory which is necessary to find C for any 
possible year. Although this assumption contains a small error, any further 
discussion of it would hardly be of any practical interest. The foregoing theory 
provides complete methods for finding w by means of a series of steps, which 
are so simple that the entire calculation can be done mentally. For example,- 
suppose we wish to find w for November 29, 1888. Each of the C’s for 1800 
and 1884 is 7. Therefore, G for 1888 is 2, which is congruent to 7 -(- 2. Adding 
2, c for November of this year is 4. Applying (3), we find that w at November 
29, 1888 is 4, which stands for Thursday. In order to calculate mentally w 
for any date of the Gregorian calendar, it is only necessary for me to remember 
the foregoing mathematical method and to remember I was bom on November 
29, 1888, a Thanksgiving Day. 

Deplorable changes were made in the Julian calendar between 45 B.C. and 
1 A.D. Also it was not until 325 A.D. that the use of the 7-day week became 
general throughout the Roman Empire, gradually supplanting the old division 
of the month into Calends, Nones, and Ides. Therefore, in order to save space, 
the application of our theory prior to 1 A.D. is left to the reader. Starting 
irith this year it is only necessary to discover a rule for finding the &s of the 
Julian calendar for the full centuries, because the rules of the Gregorian calendar 
apply to all other years. October 5, 1682, Old Style was the same day as Oc- 
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tober 15, 1582, New Style; the Gregorian calendar was bom at this date. De- 
cember 17, 1600, New Style was a Sunday, and was the same day as December 7, 
1600, Old Style. Therefore, C for 1600, Old Style is 7. It is now a very simple 
matter to complete our theory of the Julian calendar. Since C for 1600 is 7, 
subtracting 1, C for 1500 is 6. 200, 900, and 1600 have the same value of C. 

In the case of the World calendar the c’s for the three months of each of the 
equal quarters can be found as follows For the first month c is 1. There- 
fore, c for the second month is 5, which is congruent to 1 — 3. Subtracting 2 
from this 5, we find that c for the third month is 3. 



NOTES 


ON STANDARD ERROR FOR THE LINE OF MUTUAL 
REGRESSION 

By Y. K. Wong 

1. In Pearson’s On Lines and Planes of Closest Fit to System of Points in Space, 
he establishes a formula for the mean square residual for the best fitting line in 
g-space: 

(1) (mean sq. residual)^ = al, -h ••• -f al^ - 

where IRma is the length of the maximum axis of the correlation ellipse in 
g-space, and A is the correlation determinant.* 

In the present paper, we consider a 2-dimensional case, and shall call the mean 
sq. residual as the standard error, denoted by Sk. 

In 2-dimensional space, a correlation ellipse is 

(2) ax^ 4- 2hxy -f 6?/* -j- c = 0, 
where 

(2a) a — (7y, h — a*, h ~ ~ Pxv ~ ~'Vyx) ^ 

Pearson gives in the 2-dimensional space the followmg formula for Sn'- 

(3) Sn ~ ff*ffv/semi-major axis of equation (2). 

Expression (3) can be readily deduced from (1). This paper aims to present 
some formulae for Sn, more convenient for practical computation, and also call 
attention to a misprint in Pearson’s paper. 


2. From analytic geometry, we see that the angle tp, between the major axis of 
the ellipse (2) and the x-axis is given by 

(4) tan 2«j = 2ft/ {a — h). 

By rotation of the axes, equation (1) can be written in the form 

(6) a'x^ -f b'y^ -}- c = 0, 


where 

(5a) 


a' = o-cosV — 2ft*8in^'C0S(p — ft'Sin^v’ > 0 

b' — o-sinV — 2ft<sin^-co8v - b-cosV > 0. 


‘Philosophical Magazine, 6th Series, II (November, 1901), p. 569. 
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Lemma 1. The value of a' given by (5a) is less than h'. 

To prove this lemma, we find from (4) and (6) 

a' _ b' = a + 5, a' — 6' = 2/i/sin 2(p = ~ 2piv/sin 2ip, 

and hence 

(6) 2a' = a + b ~ 2p:ty/sin 2^?, 25' = a + & + 22 Jw/ain 2<p. 

Since both a and b are positive, the lemma will be proved if we can show that 
p*i,/sin 2ip is a positive quantity. By (2a), p,,/ = in which cti, cr„ are 

positive; hence the sign of p depends upon the sign of r. If r*v < 0, then <p 

Stt * 

and 2fp is of such a nature that <2^ < 2ir. It follows sin 2^ < 0, and 

z 

hence p*v/sin 2<p is positive. On the other hand, if r,v > 0, then <p is such that 
0 < 2^ < it, and hence sin 2<p > 0. It follows that Piy/sin 2ip is positive inde- 
pendent of the sign of rxy. 

Lemma 2. The square of the mean square residual is equal to a', and hence 
Sf, = al cos'* <p — 2p,y sin ^ cos -f a\ sin** ip ~ — oD — ^ • 

For from (5), we obtain (semi-major axis)* = — c/o' = -b Substituting 

Of 

this into (3), we obtain Ss = o'. The balance of the lemma follows from (5a), 

(6) , and (2a), 

Lemma 3. For every riy, we have 

(7) sin 2^ = vJVK, K={al- o*)* -f 4p*„. 

For, from (4), we find sin 2 vj = —pt,/±-\/K = 

ment given in the demonstration of Lemma 1, we see that r,y and sin 2^ should 
be of the same sign. Hence the negative sign is chosen before the radical. 
From Lemma 2 and (7), we have the formula given by Pearson: 

(8) 251* = (o* + cir - vk. 

3. We are going to establish several more formulae for Sn. From (4), we 
have 21i-tan {p) = — (a — b) ± "v/iC. The sign before the radical is deter- 
mined in such a way that tan (ip) has the same sign as r*y. By the reasoning 
given in Lemma 1, the negative sign is chosen. Thus 

-2pxytan <p = -{al - al) - VK = c* + c* - \/k - 2 cJ 
or 

2((tJ — p^^ tan,p) = al — el -- s/K. 

This proves that 

(S) Sy = tf J tan v5. 
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Similarly, we have 

( 10 ) Si = ol ~ 

For computation, (9) and (10) are more convenient than (8). When the line 
of mutual regression is detennined, it is knov.m that tan (p (denoted by B) is 
equal to the slope of that line, and hence cot <p (= 1/B) is equal to the reciprocal 
of the slope. Then we can write (9) and (10) as follows: 

( 11 ) ^ <>1 - Pv»-B 

( 12 ) 

The second formula given in Ijemma 2 ia simpler than (8), but not as simple as 
(11) and (12), 

For computation, it is convenient to find <p from the equation 
tan 2<p « « H, 

(T,— (r„ 

i.e., 

2 vj =» arctan H. 

Since sin 2<p and are of the same sign, we can determine the value of ip from 
the preceding equation by insj)ection, though arctan H is & multiple-valued 
funodon. After the determination of <fi, we can obtain 

B K tan <p. 

Then we can compute Sh either from (9), (11), or (10), (12). 

There is a very interesting fact furnished by (11) and (12). These two 
formulae are, in fact, generalisations of the following two well known ones; 

(ft) si « <rv(l ”■ 0 

(b) Sl^aUl-r), 

where is the standard error of the line of regression when y ia used as dependent 
variable and x as independent variable, and similarly for 5*. It is clear that the 
line of mutual regression may be looked upon as a generalization of the other 
two lines of regression when wc, use y or x as dependent variable. So the slope 

B of the line of mutual regrewion ia a generalization of = r— and hxv =» r— • 

<r» 0 "^ 

where the subscript yx means yonx and zy, x on y. If we use x as independent 
variable, then we must obtain instead of B. Hence substituting the formula 
of bv« instead of B into (11), we obtain, after a simple reduction, the same result 
as given by (a). On the other hand, if we use y as independent variable, we 
must obtain instead of 1/B. It will result (b) when b,v is put in the place of 
1/5 in (12). The generalization perhaps can be seen more clearly if we write 
(a) and (b) into slightly different forms: 
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(a') Sj = 

(b') Si = al - 

4. The misprint in Pearson’s paper is on the second formula of the following; 

[uaRY =■ „^ = K'S - 

where tan 2<p = 2rxya^v/{ol — o\). cot^ip should read “square of semi-major 
axis of ellipse (2).’’ Professor Henry Schultz first noticed this misprint and 
suggested to the writer to investigate it. 

In a recent letter to Schultz, Pearson pointed out that one of the simplest 
formula for Sy or {MSRY is given by 

(a) S.v = al sin* <p + al cos* ip, 

where (p is defined by (4). However, Professor Schultz expressed doubt about 
its validity. From lemma 2, it is clear that («) is also not true. 

Institute of Social Sciences, 

Academia Sinica, Peiping 



the DISTRIBUTION LAWS OF THE DIFFERENCE AND QUOTIENT OF 
VARIABLES INDEPENDENTLY DISTRIBUTED IN PEARSON 
TYPE III LAWS' 

By Solomon Kut,LiucK 

Although the results herein described are not entirely n(*w, it is felt that the 
method of solution is of interest as presenting further illustrations of the afsplica- 
tion of characteristic functions to the distribution problem of statistics (/), 


1. Distribution law of the difference. Ix;t u ~ t ~ when* the di.stribu- 
tion laws of x and y are independent and given respiuitively by 

(1) ?/<?». 

Kp) I w 


The characteristic function of the dwtribution law of u is given by (/), 

(2) ^(0 = 


c'*"' dx I ” !/«■"' dy 


( 3 ) 


I'(p) 

1 


rb/) 


(1 - tt)>‘ (1 + U)" 
The distribution law of u is given by (i), 

1 r 


(4) 

Let 1 — il 

( 5 ) 


D(u) = 


j-„ (1 - ay (1 + <0" 


D(u) 


g-uyp-i r-«+.« 


c"' dz 




Now it may be shown that (1) 


( 6 ) 


In 


dz 


(- z)‘ 


(' + 0 


^i2u) 2 


W 


szs.i 

I 3 


p-s (2«) 


' Presented to the American Mathematical Society, June 20, 1034. 
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where Wi^, „(z) is the confluent hypergeometric function (2). Since PF't, ^(z) 
we have finally 


W = (2^)- 

“5“ 2 ’ 2 


2 Tip) 


For p = g, since Wo, m(2x) = K„ix) where K„ix) is the Bessel Function 

V r 

of the second kind and imaginary argument (f), we obtain 


(8) Diu) = iu). 

2 ' T(p)V^ 

This result has been otherwise obtained by Pearson, Stouffer, and David (S). 

2. Distribution law of the quotient. Let u = log x — log y where x and y 
are defined as above. 

The characteristic function of the distribution law of u is given by {1) 

(Q\ JA e-^ dx T® e-« dy 


Tjp + tt) Tjq - it) 
Tip) T{q) 


The distribution law of u is given by (1) 


= - r Tjp + it) Tjq -it) 

2’r J-« Tip) Tiq) 


Let q — it = — z, so that 


(12) Diu) = 


r(p) r(q) 2irf 


/ — g+ioo 


Tip + ? + a) r(— e) 


Now it may be shown that iS) 


/ — g+t« 

e-«' 

•g— \to 


so that 


Tip + q + z) r(— z) dz = Tip + q) il -f- e-u)-(P+a) ^ 


Diu) = ^(T+q) 

r(p) Tiq) (1 + e»)p+* 
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Since e“ = - = to, we obtain as the diKtrihiHiuii of fit- tjun!;* t,* 

y 


(14) 

If in (13) we set 


p(iv) 


I'U’ + V! »■’' ® 

r(p) rO/) fi i »■)*' 


we obtain 


(15) 


P - 2 ' 


-0(2) 


y 


m 


2 ' 


**5 » 

II I II j I ' 


r 


Kilt!’) 

/ niN /nt\ 

I 2 yi'\^2 y ‘1“ 


which result has been otherwise obtained by II. A Fi'-lnn" i J’* 
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lEPORT OF THE MEETING OF THE INSTITUTE OF 
MATHEMATICAL STATISTICS AT ST. LOUIS 

The Institute of Mathematical Statistics held a joint meeting with the 
Econometric Society and the American Mathematical Society at St. Louis, 
Missouri, on January 2, 1936. The program consisted of an invited address, 
^‘The Mathematical Theory of Index Numbers,” by Professor Thomas Rawles, 
and the following additional papers: 

(1) On Certain Distributions Derived from the Multinomial Distribution, 

by Dr. Solomon Kullback 

(2) Convexity Properties of Generalized Mean Value Functions, by Dr. 

Nilan Norris 

(3) The Frequency Distribution for the Mean of n Independent Chance 

Variables When Each Is Subject to the Law by Prof. 

W. D. Eaten 

(4) On the Admissibility of Time Series, by Prof. Francis Regan 

The Institute voted to hold a meeting at Cambridge, Massachusetts, early 
in September of this year. This meeting will be in connection with the celebra- 
tion of the Harvard Tercentenary. Professor R. A. Fisher will deliver an 
invited address before the Institute and the American Mathematical Society. 
A more detailed announcement of the meeting will be made later. 
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SHEPPARD’S CORRECTIONS FOR A DISCRETE VARIABLE 


Cecil C. Craig 


In the Annals of Mathematical Statistics,^ J. R, Abemethy gave a derivation 
of the corrections to eliminate the systematic errors in the moments of a discrete 
variable due to grouping. It is the purpose of this note to considerably shorten 
and simplify the derivation of these corrections by an adoption of a device used 
by R. A. Fisher (not published so far as I know) in the case of the ordinary 
Sheppard’s corrections. 

Let us suppose that m consecutive values of the discrete vanable in question 
are grouped in a frequency class of width h. The m smaller intervals of width 
k/m go to make up the class width k, the actual points representing the m 
values of the variable being plotted at the centers of the sub-intervals. Now 
let us suppose that each of m consecutive boundary points of the sub-intervals is 
as likely to be chosen as a boundary point of the larger intervals as any other. 
Then, if a;, is the class mark of the f-th frequency class, for any true value, x, of 
the discrete variable included in this frequency class, we have 


Xf — - X -|~ 6 

in which x and t are independent variables and e takes on the m values 


m - 1 


h/m, 


m — 3 


k/m, 


m 


'3 , ■ m 
— k/m, — ; 


1 


k/m, 


with the equal relative frequencies 1/m. 

The moments of a:, are those calculated from the grouped frequency distri- 
bution; the problem is to express the average values of the moments of x in 
terms of the calculated moments and k and m. The use of moment generating 
functions at once leads to the desired results. Denoting the s-th moment of x, 
about any origin by j'^, the like moment of a: by the respective moment 
generating functions of the two variables by and Mxii}) respectively, 
we have at once 

( 1 ) = 2 “> 

7th 

, / 


1 “On the Elimination of Systematic Errors Due to Grouping,” vol. IV (1933), pp 
263-277. 
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in which by definition 

= 1 + .-i'.? + «?V2! + *?V3! + • ■ , 

M.{^) = 1 + + M3 t?V3! + • ■ • . 

The computation necessary to get the actual corrections consists in the calcula- 
tion of the coefficients in the formal expansion of 


( 2 ) 


k/m 


MM = 2 


in— 1 , , 

k/m 


0*1? 


m 


in powers of ?? and then solving for the n',’sm (1). 

But the summation indicated in (2) is readily effected by means of the calculus 
of fiilite differences. In fact, we get 


(3) 




kO/m — kOh» 

e — e 


- 1 ) 


sinh M/2 
m ,sinh fct?/2m 


Then (2) becomes 

(4) 


= MM 


sinh M/2 
m sinh M/2m ' 


If we let m — > 00 we get the corresponding result for a continuous variable 


(5) MM = MM) 

already given by Langdon and Ore,'* though in a less elegant manner; for in this 
case, the expression analogous to (1) is immediately seen to be 

fk/2 

MM) = MM) / e‘^ «?6/fc . 

d-I/2 

Returning to (4), taking the logarithms of both sides, remembering that the 
logarithm of the moment generating function is the generating function of the 
semi-invariants of Thiele, we get, 


( 6 ) 


X3tyV3'-t- ••• 


= M -f Xj 1 ^ 72 ! + h t?V3! -f ... - log 


Mf2m sinh k&/2 
M/2 sinh M/2m ’ 


in which the X/s are the calculated semi-invariants and the Xr’s the corrected 
ones. 


“ W. H. Langdon and O. Ore, Semi-invarianta and Sheppard's Corrections, Annals of 
Mathematics, vol 31 (1930), pp 230-232. 
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But since 


X 


5, 


2s (2s) 


(2x)^» 


we have on setting: 




(8) 


02, = 


Oo = 0, 02 ,+! =0, s = 0, 1, 2, 


2s 




s — 1, 2, 3, • • • . 


Obviously these a’s are the “Sheppard’s” corrections for the semi-invariants. 
We have generally 

Xss+l = Xja+l, s = 0, 1 , 2, • • • . 

. B,k^‘ 


X2, — X2, -f- ( — 1)* • 


2s 


-A _ _lY 

\ m^V 


In particular 

X. . - (1 - ^)(.V12 


X, = x« - ( 1 - ^ ) fcV252 


X 4 — X 4 


(’-s) 


feVl20 Xs == Xs -f 


G-s) 


fcV240 . 


For m — > 00 , these give of course the results reached by Langdon and Ore.® 
To get the corrections for the moments let us set 


From (7) and (8) 


«o = 1, ajn+i = 0, n = 0, 1, 2j • • • 

(9) = V (2w) 1 0 / 2 ( 140^6 * * * 

■" ^ (2!)'(4!)'(6!)‘ ••• r!s!t! ••• 

the summation extending over all positive, integral values of r,s,t, ■ for which, 


r -f 2s -f 4- - • • = R. 


’ Log, oit. 
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Then finally we have the formula, 

I] 

(10) ■ = S 

8 5= 0 

for the corrected moments. 

Writing out the first four a’s, we have for the first eight moments about the 
mean 

Ml = n = 0 

= yj - (1 _ 1 /m') fcV12 

M3 = ^3 

_ (1 _ 1/m') r 2 fcV 2 + (1 - l/m')(7 - 3/m') fcV240 . 

M 6 = I's ~ 5(1 — ■ 

M 3 = n - 5(1 - l/m=) r 4 fcV 4 + (1 - l/m')(7 - 3/m') Pik*/16 

- (1 - l/m')(31 ~ 18/m' -f 3/m')k‘/1344 
_ 7(1 _ 1/m') 1 / 5^74 + 7(1 - l/m')(7 - 3/m') 1 - 3^748 

^8 = ,8 _ 7(1 _ l/m')r6fc'/3 + 7(1 - l/m')(7 - 3/m') i',k*/24 

- (1 - l/m')(31 - 18/m' + S/m*)) rs fcV48 

+ (1 - l/m')(381 - 239/m' + SS/m* - 5/m7fcV11520 

The final term in M 2 n as given above is a 2 „ . 

The above method is readily extended to the case of two or more variables. 
We will illustrate the procedure by getting the results likely to be required for 
two variables. As before we suppose that m consecutive values of x are grouped 
in a frequency class of width k, and we shall similarly suppose that n values of y 
are grouped in a frequency class of width 1. And arguing as before we write non 

Xi = X ^ 

y, ^ y + n 

111 which i and rj are independent of x and y and of each other. 

The moment generating function of two variables is defined by the identity 
in 1 ? and « ; 


n\ / 


w) — 1 7 (mi od 4" Mo ii>>) “b ^ (msoi^^ "b 2mi idoi -b mo 2 *^^) ~b ‘ ' 

= 1 + (mi od -b Mo 1“) + (mi od + Mo 1“)^*^ + ^ (mi oi? + Mo 1“)'^^ + ‘ < 

in which the manner of expaasion of (mi«i> + yh i^)'’'’ is evident. 
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Then from the properties of moment generating functions, we have 


( 11 ) 


7»— 1 


X./m 


iWl., to) = to) 2 


1 . 
^ 


n— 1 
2 


i/n 



mn 


,, /. s sinhifct?/2 sinh i!co/2 

— vTTTTo — * — vTt — 7o~" * 

m Sinn M/2m n smh Za>/27i 

As in the case of a single variable it will be simpler first to get the corrections 
for the semi-invariants. The logarithm of the moment generating function 
is the generating function of the semi-invariants ; thus 

log Mx,y{'^, to) = (Xio?^ -h Xioco) ^ (Xiot? -j- Xoi«)® ^ (Xioi? -h Xoiw)^®^ -f- • • , 
m which 


(Xiot? Xoito)^®' — Xgo*^^ “h SXji^^to -|- 3X]2t?co® Xosto®, 


etc. 

We write (see (7)), 
( 12 ) 


, m sinh M/2m , .i,., , 


with 




asr = k: (1 - l/w=’0 


52, = 


2r 

(-D’BxP’ 

2s 


(1 - l/n^’) . 


(13) 


Then from (11) we have 

(Xiot?- -j- Xoito)®*+*5 = (Xiot?^ Xoi<o)f*'+^5 , s = 0, 1, 2, 


(Xioi? + Xoico)»*) = (Xiot? + Xoito)(2*) -|_ -f 52.w=*, s = 1, 2, 3, • • • , 


in which, of course, X„ is a calculated semi-invariant and X,, a corrected one. 
We read off 

Xr, ~ Xra , TS 0 , 

as already shown by Wold in the case of continuous variables,'* 

Xjj+i.o = Xjj+i.o , Xo, 2 ,+i = Xo, 2,4.1. 


‘Herman Wold: Sheppard's Correction Formulae in Several Variables: Skandinavisk 
Aktuarietidskrift, vol. XVII (1934), pp. 248-265. 
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The values of Xas.o are the same as those for Xz, given above and. those for Xo, 2 , 
are obtained from these merely by replacing m them m and k by n and I And 
it is quite obvious that for any number of variables the only semi-invariants to 
be corrected are those in which a single figure of the index is different from 
zero and is moreover even For such semi-invariants the corrections are 
naturally those derived for a single variable. 

Now to derive the corrections for the moments, we write 


m binh fcd/2m _ asinh fa;/2n ^ ^,/ 2 I(«»W)+i/ 4 '«m 5 ‘W) + . 
smh M/2 sinh Zcu/2 


= 1 -f- 1/2! (a2od^ 4- ousioi^) 4- 1/4! (ccaod^ 4- 


with now, 


V 2 o-t-o!o 2 ; (2I)’' (4!)“ r! s! ■ ■ ■ 

the summation to be over all positive integral values of r, s, • • • for which 

T 4“ 2s -j- • • = il 

and m which the parameters d and w may be omitted without ambiguity. 
The formula for the corrected moments can now be written 


(14) 4- ^2g^ «o2)^«' (r(i) -f . 

This gives 

Ml 0 4 " Mo I = ri 0 4 " ro 1 
(mio 4 " Moi)® = (rio 4 ~ 4 ~ (“zo 4 “ “oz) 

(mio 4- Mol)® = ("lo 4- 4- 3 (i>Id 4- >'ot) (“20 4- 002 ) 

(mi 0 4- Mol)® = (rio 4 - I'o 1 )® 4- SCrJ 0 4- Voi)® (a 2 o -j- aoi) 4- (azo 4“ aoz)® 


Noting that, 

(o£ 2(| 4" Oios)® = O 4 4- 4" 3(02 4" 62 )^^ , 

we get the following formulas for the correction of the product moments about 
an arbitrary origin : 

Mu = f'li 

M2 1 = — (1 — 1/m®) Vox fc®/12 

mU = r (2 - (1 ~ 1/n®) f®/12 

Mu = rn — (1 ~ 1/m®) fc®/4 

M 22 = >'11 — (1 -- 1/m®) ^20 Z®/12 — (1 — 1/n®) ro 2 fc®/12 

- (1 - 1/m®) (1 - 

Ml 3 — >'1 3 — (1 — 1/n®) I'l 1 P/4 . 


1/n®) fc®Z®/144 
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The above results give the corrections for moments about the mean, merely 
by dropping the primes and setting vm = roi = 0 . In practice the corrections 
needed are for moments about the mean, and though there would be no difficulty 
in computing additional results for an arbitrary origin, I shall give here only the 
additional results for moments about the mean through the sixth order, omitting 
those obtained merely by permutation of subscripts and interchange of k and m 
with I and n respectively. 

First, the necessary extension of ( 15 ) is 

(mio “h “ (rio -(• -J- 10 (vio (0:20 -h 002) 

( 15 ) (rio -|- jLioi)®' = (rio + + 15 (no + roi)^'*^ (020 + *02) 

+ 15 (no + roi)*’*’ («2o + ao2)®^ + (0:20 + 002)® 

We need the additional relation: 

(«20 + Ofoo)® — Co + &G + 15(04 + hi) {0,2 + bi) + 15 (02 + b2)^ . 

The additional formulas for product moments about the mean follow; 

M 41 = "41 - (1 - 1/m-) fcV12 

Ai 82 = r 32 — (1 — 1 /n?) I'ao 1^/2 — (1 — l/m^) n2 k'^/i 

/i6i = ^51 — (1 — 1 /W) 5 ni fcVO + (1 — 1 /m^) (7 - 3 /m-) m k^/iS 

= ,,2 - (1 - 1/+) no r-/12 - (1 - l/m^) n2 fcV2 

+ (1 - l/m^) (1 - 1/n^) no kW/ 2 i 

+ (1 - l/m^) (7 - 3 /m 2 ) v, 2 ¥/ 2 i 0 - (1 - l/m^) (7 - S/m^) (1 - 1 /n') MV 2880 

W 3 = V33 — (1 — 1/m^) no k'^/i - (1 - 1/w'^) m IV 4 

+ (1 - l/m^) (1 - l/n=) m knym . 

For m and n infinite these results give the formulas for two continuous vari- 
ables already found by Baten® and Wold.® 

The reader will note that this development does not impose the “high contact" 
condition, except in so far as it assumes the existence of the moments that 
occur in the formulas. And it exhibits in the clearest fashion that Sheppard’s 
corrections are corrections on the average, 

Univebsity OP Michigan 

‘ W. 1 ). Baten: Corrections for the Moments of a Frequency Distribution in Two Vari- 
ables; Annals of Mathematical Statistics, vol. II (1931), pp. 309-319 
“ Loc. cit , p. 253. 



FimDAMENTALS OF THE THEORY OF INVERSE SAMPLINGi 
By Ching-Lai Shen 
Part I. Introduction^ 

Section I. Statistical Concepts of the Theory of Sampling 

One of the chief objects in statistics is to form a judgment of a very large 
statistical universe, known as a parent population, by means of a study of a part 
or sample thereof, which is drawn at random. To make a complete survey of 
the parent population is sometimes impossible or impractical. For example, 
it is impossible to measure the heights of all adult persons in a country. It is 
impractical to test for infectious bacteria the whole body of water in a city 
reservoir. All that we can do is to obtain an unbiased sample. By an unbiased 
sample, we mean a sample in which each individual has an equal and independent 
chance to be included. From this chosen sample we attempt to draw some con- 
clusion concerning the nature of the whole parent population in accordance with 
certain mathematical principles 

Now the sample which we choose is of course only one of the samples that can 
be possibly drawn from a given parent population. Suppose there is a popula- 
tion of s individuals from which we wish to choose a sample of r. It is clear 
that there exist jOr such samples, each of which is equally likely to be chosen. 
Therefore these ,Cr samples constitute the so-called distribution of samples. 
To describe from the statistical point of view the distribution of samples, we 
must find its mean, standard deviation, skewness, excess, and other higher 
characteristics. The first three are usually referred to as elementary statistical 
functions. 

Suppose a;, be the variate (by which we mean the magnitude of a specified 
character of an individual to be measured) where t = 1, 2, 3, • • • s; and z, be the 
samples chosen from the parent population where j = 1, 2, 3, • • • .C,. Then 
the ,Cr samples, each consisting of r variables, will be formed after the following 
fashion: 

Zl = a:i -b 22 -f *3 -f • • • -f Tr 
Zs — 22 -|- ils -1" ^4 "b ■ ’ • "b ^r+l 


— 2,_r+l "b 2,_r+2 "b 2j_r+3 4" • ■ • "b Tj 


‘ A dissertation submitted in partial fulfillment of the requirement for the degree of doc- 
tor of philosophy in the University of Michigan. 

® The writer wishes to express his appreciation for the assistance Professor H. C. Carver 
has given him in making this study. 
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If we denote the ?ith moment of the parent population about its mean by 


r (a:. -Jlf*) 
. = 1 


and the nth moment of the distribution of samples about its mean by 

(?) 

L (2; - M,y 


— 1 => 1 
^ — — — 


and if we then utilize the multinomial theorem, we may he able to express the 
sample moments in terms of the moments of the parent population:® 


( 1 ) 



= rM 

X 




ih.z 


< 

SP2,X 1 

2! J 



pas 

-3!^ 

f 

Ps 

S 

SftZtx 1 

3! J 



fii 2 

= 4!^ 


S/14 * 

4! 

+ 

Pi s‘ 
2! “ 

P6:s 

= 5.^ 


SfX^ X 

5! 

+ 

Pa Pa 

P6.S 

= 6!^ 

[p. 

Spa:x 

6! 

+ 

P 4 P 2 


3! 2! 
4! 2! 


Pi Pi s^fil , 

2! (3!)* 3! (2!)® ' 


where Pn is obtained from the sampling polynomial Pn{p) by writing p' as p, : 

>i(p) = P 

Pi{p) = p - p® 

/f>\ ^ P sCp) =: P — 3p® 4" 2p® 

]p,(p) = P - 7p® + 12p® - 6p^ 

Pb(p) = p - 15p® + 50p» - 60p" + 24p® 

PeCp) = p - 31p® 4- 180p® - 390p< 4- 360p5 - 120p®, etc. 

where 

— — i-)(r — 2) ■ • (r — ^ — l) 

s(s ~ l)(s — 2) • • ■ (s — i — 1) 


Carver, H C , Annals of Mathematical Statistics, Vol I, No. I, pp. 106-107i 
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Section II. Feequenctt Curve of the Distribution of Samples 

The frequency distribution of samples is usually less scattered than individual 
observations. In order to ascertain the manner of the distribution, we have 
access to the well-known Type A Curve of Charher 

( 3 ) Fit) = ct>it) - - 1 + • • • 

1 zll 

where 4>it) = e “ 


C3 = Oii 

Ci = on — 3 

Cs = as — lOaa 

Ce = oie — 15a!4 + 30 

C; = a? — 21a6 + lOas 

Cs = ag — 28a8 + 210a4 — 315, etc. 

This formula is a powerful tool for representing any frequency; but it is 
emphasized by more than one author® that the usefulness of such a series repre- 
sentation of a frequency distribution depends upon the rapidity of convergence, 
and the rapidity of convergence in turn depends upon the extent to which the 
function is a fair approximation for F{t). We shall not, however, discuss 
here the question of convergence. What we are interested in is to apply this 
series representation to the distribution of samples and see whether our numerical 
experimentation justifies the use of it. 


TABLE I 

Heights of 1000 Freshman Students 
(Original Measurements Made to Nearest O.ljn.) 


Class 

Frequency 

58 5-60.4 

2 

60 5-62.4 

13 

62.5-64.4 

76 

64.5-66.4 

167 

66.5-68 4 

335 

68.5-70.4 

264 

70.5-72.4 

106 

72.5-74 4 

29 

74 5-76 4 

7 

76.5-78 4 

1 


^ Camp, B. H., The Mathematical Part of Elementary Statistics, p. 226. 

‘ Rietz, H. L., Mathematical Statistics p. 62. 

Carver, H. C., Frequency Curves, Handbook of Mathematical Statistics, p. 116. 



FUNDAMENTALS OP THEORY OP INVERSE SAMPLING 


65 


First of all, therefore, we take for our numerical example the heights of 1000 
freshman students in the University of Michigan, as recorded in Table I, which 
are assumed to constitute our parent population. 

From the above data we compute the first 6 moments as follows : 


= 67.91 

6 279,068 
M3:>= 0.489,552 

= 132 685,214 
= 78 435,794 

= 4574 080,554 


<r. = 2 505,81 
a3x= 0.031,11 
Oix = 3 365 , 36 
as* = 0 793,92 
ag a = 18 47 6 , 43 


Now suppose from this parent population in which s = 1000, we wish to 
choose loooCioo samples, each consisting of 100 individuals. To characterize 
the distribution of these samples. We first make the following table : 


TABLE II 

Values of p, and P, for s = 1000, r = 100 


Pi = .1 

P2 = .009,909,909,91 

P3 = .000,973,117,406 

P4 = .000,094,676,417,6 

Pt, = .000,009,125,437,84 

^3= 000,000,871,272,959,5 

Fi = .1 

Pj = .090,090,090,09 

Pa = .072,216,505,082 

Pi = .041,739,980,994 

Ps = - .005,454,352,918 
Pe = - 065,789,272,230 
Pi = .008,058,351,516 

P 2 P 3 = .006,472,571,500 

P 2 P 4 = .003,764,792,358 

P^ = .000,715,593,194 

P^ = .005,195,978,741 


Substituting into formulae (1), we obtain the first six moments of the distri- 
bution of samples: 


Mx = 
ih.z = 

Msi* = 

Mi;z — 

MS:* — 

fii.z — 2,704 


6791 

565.621,622 
35 353,734 
958,720.852,854 
198,538.702,142 
514,780.791,465 


tr, = 23.782,8 
a3*= .002,628 

a4,„ = 2.996,679 
a6^= .026,093 

a,,x = 14.945,539 
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The coefficients of Charlier’s Type A Curve turn out to be very small and 
rapidly decreasing: 


I = .000,438 

^ = - .000,138 
4! 

5 = - .000,016 

51 


= - 000,006 

We therefore may be justified in considering this series representation of the 
sample distribution as converging rapidly to the normal curve. It may be 
interesting to note that even from a parent population which is very sjcew, the 
distribution of samples is nearly normal — as the following example will show: 

TABLE III 

Weights of 1000 Freshman Students 


(Original Measurements Made to Nearest Pound) 


Class 

Frequency 

85- 

1 

95- 

8 

105- 

45 

115- 

1 132 

125- 

232 

135- 

244 

145- 

161 

155- 

97 

165- 

50 

175- 

16 

185- 

7 

195- 

3 

205- 

4 


M, = 139.32 

jk.z = 296.8343 

/Ij,, = 3,230.802 

= 351,180.14 

Mm = 11,811,480.5 
M6i* = 886,585,271 


<T, = 17.228,87 
03*= 0.631,74 
04.= 3.985,67 
06:* = 7.780,71 
06:* = 33.898,36 
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M. = 

13 ,932 


fia . = 

26,741 828,829 

O', = 163.529 

M3'. = 

233,317 229,045 

“3 z — 05334 

M4z = 

2,144,736,851.477,805 

ai, = 2 9991 

M6.3 = 

62,008,368,279 121,883 

asz = 53024 

M6:z “ 

287,107,828,746,809 017 

ao., = 15 00633 


= 008,89 

= - 000,04 

I? = _ 000,03 
5l 

I = .000,03 


Indeed the distribution of samples, in general, is very nearly normal irre- 
spective of the law of distribution of the parent population. From the practical 
point of view, as Professor H. C. Carver has remarked, the parent population has 
little control over the shape of the distribution of the samples of r is fifty or 
greater and if S is at least ten times as large as r.® 

Now as a numerical illustration of the theory of sampling I may, for example, 
choose at random 100 weights from the parent population of 1000 weights of 
freshman students, as recorded in Table III, with the aim of ascertaining the 
probability that the mean of this sample exceeds 142 pounds. 

Since we define the mean of a sample simply as the average measurement of 
the r individuals m the sample, which in this case is 100, it therefore follows that 
the ordinary moments of the distribution of sample means differ from those of 
the distribution of samples in (1) only by a constant multiple of l/r* where /c is 
the order of the moments concerned, while the standardized moments remain 
unchanged. Therefore in this problem, we have the mean of the sample means 
equal to 139.32 and the standard deviation equal to 1.63629. The average 
weight, 142 pounds, may be expressed in standard units as 

z - M, 142 - 139 32 
‘■-Vr’- — 6352 9 


In accordance with (3), the probability that the mean of the sample exceeds 
142 pounds is therefore equal to 



<^(0 - + 


dt 


^ CuryeTj C,f Annals of Mathematical StatisticSjYol I, No I, p. 112. 
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If -we take the first terra only 


If we take the first two terras 
05218. 

If we take the first three terms, 


F - 1 


(l>{t)dt — .05062. 

s 

^it)dt - .00889 


P = 


cj,{i)dt - .00889<f.<«(0 


Jl 63885 


+ (-.000,04),^®(0 


Jl.63885 


= .052182. 


1.63885 


Section III. Pearsonian Types of Curves 

Charlier’s Type A Series is, however, not the only known analsAic representa- 
tion of a frequency distribution. There are Pearsonian Types of Curves, the 
characteristics of which I shall need to summarize briefly. These Pearsonian 
Types of Curves are essential to the later development of our theory. 

The curves, suggested by certain geometrical properties of unimodal frequency 
distribution, are all obtained from the solution of the differential equation: 

\dy __ a — t 
ylt " JW 


where /(<) is assumed to be possibly expanded into a convergent power series, 
that is, fit) = bo + hit -b bjt^ . When the first three terms of the power 
series are taken, the differential equation immediately takes the form of 

= r — r £", — 7 - r The parameters, a, h, hi, h%, may be expressed in 

y (it Oo "p Oic -p 021^ 

terms of moments;'’ 


a - b— 

2(1 + 25) " 2(1 -f 25) 

b — - h — ^ 

' 2(1 + 25) ^ 2(1 -t- 25) 

where 

^ _ 2a4 — Sag — 6 

«4 -b 3 

Based upon the difference in the nature of the roots of the equation 
bo -b bii -b hii^ = 0, there have been derived thirteen types or curves. Of the 
particularly noteworthy ones, the normal curve and Type III may be men- 
tioned. The criterion for the normal curve is 0:3 = 5 = 0; that for Type III is 


’’ Carver, H. C , frequency Curves, Handbook of Mathematical Statistics, p. 104. 
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5 = 0 and as 7^ 0, In order to fix the form in a particular case, we may refer 
to Pearson’s Chart ^i/32 Distribution® where 

-2 

o _ Ma 2 a 

Pi — — «3 > P2 — = 04 , 

P2 M2 

and 

jr^ _ ^1 _ 0l(&2 “b d)® _ a1 

46o 62 4(4/32 - 3/3i) (2/32 - 3/3t - 6) “ 46(2 + 5)’ 

or to Elderton’s Frequency Curves and Correlation.'-^ 

Section IV The Inverse Sampling, Our Problem 

It is now our problem to study the theory of inverse sampling, by which wo 
mean that given the characteristics of a single sample drawn at random from a 
parent population, we wish to ascertain the probability that the corresponding 
characteristics of that parent population do not differ from those observed in 
the sample by more than a specified amount. To illustrate, suppose we are 
interested m knowing the average height of 1000 freshman students to which 
reference has already been made. Due to the fact that it takes too much time 
or is otherwise impractical to measure all of them so as to obtain the true average, 
we select at random one hundred of them and measure the heights of these one 
hundred individuals Suppose the mean, the standard deviation, and the 
skeivness of this sample of one hundred are computed and they are as follows: 

M = 67 99 

<7 = 2.327 

aa = - . 12299 

Now assuming that the true mean of the entire 1000 heights is unknowm, let 
us find the probability that the trye mean of this parent population lies between 
Mx = a and Mx = 6 by what we know of the characteristics of the observed 
sample of one hundred as recorded above. It is clear that if we can obtain an 
equation, y = fiMx), of the frequency curve associated w'ith the distribution of 
hypothetical means of this, parent population, we shall be able to ascertain the 
probability we desire by evaluating the following integral expression: 



® Pearson, K , Tables for Slaiishcians and Biometricians, Vol. II, front page. 
“ Elderton, W. P., Frequency Curves and Correlation, Table VI, opposite p 46. 
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In the same way we can find the probability that the standard deviation of 
the parent population lies between two definite limits or that the skewness of 
the parent population lies between two definite limits. 

Our procedure will therefore be as follows; First, assuming the a priori 
existence of a continuous sequence of hypothetical means of the parent popula- 
tion, we investigate the relation between the distribution of these hypothetical 
means of the parent population and the distribution of sample means. If such 
a relation exists, we shall be able to find an expression for the most probable 
value of the parent mean. Assuming the most probable value of the parent 
mean to be the true mean of the parent population, we shall obtain an expres- 
sion for the most probable value of the standard deviation of the parent popula- 
tion. Then it will be possible for us to express the frequency curve associated 
with the distribution of hypothetical means of the parent population in the form 
of fiMx). Similarly we may find the frequency functions associated with the 
standard deviation and skewness of the parent population. 

Before leaving this section, it is perhaps not out of place to say a word about 
the connection of this theory of inverse sampling with Bayes’s Theorem. The 
theory of inverse sampling (which deals essentially with the problem of judging 
the nature of a whole by observation of a part of it) belongs to the domain of 
inductive probability, or inverse probability, upon which Bayes’s Theorem was 
founded. In order to solve a problem of inductive probability, it is necessary 
to postulate the a priori existence of the causes from which an event takes place, 
which, in our case, is the hypothetical means of the parent population. 

This a priori hypothesis which gives rise to Bayes’s Theorem has been viewed 
with suspicion by a number' of mathematical statisticians. For example, the 
theorem has been called into question by such mathematicians as Bing, Venn, 
Chrystal, and others, including several now living. But so far as the present 
writer is aware, no definite conclusion has been reached. It is true that on the 
one hand Bayes’s Theorem has not been rigidly demonstrated and proved by 
logic; but on the other hand the process of generalization from observational 
data is justified within the limits of ordinary practical application. One who 
holds Bayes’s Theorem strongly may even say that the a priori hypothesis is 
absolutely necessary to scientific inferences. Concerning this controversy, 
Pearson takes a liberal point of view: “I hold this theorem [Bayes’s Theorem] 
not as rigidly demonstrated, but I think with Edgeworth that the hypothesis 
of the equal distribution of ignorance is within the limits of practical life justified 
by experience of statistical ratios, which o priori are unkno'wn • • He has 

further remarked that “the practical man • • • will accept the results of inverse 
probability of Bayes-Laplace brand till better are forthcoming.’’^ Using 


Pearson, K,, On the Influence of Past Experience on Future Expectation, Philosophical 
Magazine, Vol, 13, Jan.-June, 1907, p. 366. 

“ Pearson, K., The Fundamental Problem of Practical Statistics, Biomeirika, Vol. 13, 

1920-21, p. 3. 1 . . 
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Pearson’s viewpoint, we shall proceed with our problem by postulating a •priori 
the existence of hypothetical means of the parent population from which our 
sample is drawn 


Part 11. Fundamental Relation between the Moments of the Distribution of 
Sampling Means and the Moments of the Distribution of the Hypothet- 
ical Means Associated with the Parent Population 

The characteristics of the distribution of sample means, as we have pointed out 
in Part I, Section II, differ from those of the sample distribution only by a 
constant multiple of {l/rY where h is the order of the moments concerned. We 
may write down the first six moments of the distribution of sample means: 



From these we immediately obtain 

= Mi 

= -j/ r(s -’l) 

s — 2r / s — 1 
(5) j ~ s - 2 y r(s - r) 

(s - 1) (s'* 4- s - 6rs + 6r2) , 
^ ^ r{,-r){s~2){s~Z) 

6s(r — 1) (s — r — 1) 

" r{s - r) (s - 2) (s - 3)’ 
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If our parent population is infinite, which is a special case by allowing s — > = 0 , 
then we have 


(6) 


= M. 

1 

— 7= ^3 a; 

V r 

1 

= — p “S,» 

Vr 


— 



etc 


Let us now define /(i) as a frequency function of the distribution of sample 
means 2 * in standard units, i.e,, 


( 7 ) 




Denoting the observed mean of a given sample by mi and making proper sub- 
stitutions of (5), we obtain 


( 8 ) 


Wi — Mx 


mi — Mx 

V r(s - 1) 


It is clear that if wc hold s, r and a-^ constant and let M * vary, then t is a 
function of Mx only and consequently /(t) becomes a function of Mx- 
Suppose now M^x \ M''x \ • • • be a continuous sequence of hypothetical 
means, which M» has an equal chance to assume. These hypothetical means 
will certainly lie in a linear interval between their natural limits. Then the 
probability that Mx lies between M j =fc | dJkf * jsf{t)dMx ■ Therefore, to obtain' 
the probability that Mx lies in the interval Mi’* g M* g it is only 

necessary to carry out the integration of this expression : 

( 9 ) / , , fit) dMx 


There is no question as regards the existence of this integral in case of an 
infinite parent population. As for a finite population, we may still use this 
continuous function as an interpolation function to the true discontinuous 
function. 

Let us now define P(0 as the probability function for which the hypothetical 
mean of the parent population falls within certain specified limits. Considering 
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lin- p as the nth moment of this probability function about a fixed point, we will 
have the following relation: 


( 10 ) 


Mn j) — 


t : 




where I and — I are their natural limits. 

Since from (8), Mi = mi — a-^J, then after .substitution, we obtain 

Z 




(mi - (r,j)’'f(t)dt 


Mn p 


1 

L,-i 


(11) 


m^+ I 


I 


(nil — f(t)dt 


( 12 ) 


= ml - Tj j ml ^ + f 2) ““ “ w zi - u) h.zz 


+ + (- 1 )" 


Mn Zx 


^l:p — 
fii.p = mf 

Map = ■^l “h 3miM2 Zx — M3 mi 

M4.P = mj + Qmlfiixx — 4mi;i3.zj, + pn.zx, etc. 


The first relation Mp = mi is important because it shows that the mean of 
the hypothetical means of the parent population is equal to the mean of the 
observed sample drawn from it. To state this in a theorem, we will have 
Theorem I. The expected value of a parent mean is equal to the mean of an 
observed sample chosen from the parent population. 

We now wish to express the moments of the probability function about its 
mean in terms of the moments of sample distribution. In general, the n.th 
moment of any frequency distribution about its mean, m„, can be expressed in 
terms of its moments about a fixed point after the following fashion: 


Mn 


- Mm«-i.+ Q +(-!)" M". 
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Therefore whea we substitute (11) into (13) we obtain 

= mt - mr‘ mu* + ^2) ' 

+ (-1)"~’ _ 3 ^ ml Pn~3.,^ 4- (-I)" “ _ 2 ) 

+ (-1)”-^ mi + (-!)" (^2) 


mi pn-2 1 


M2- zx 


mi fin~2\ 


4 - 

M2 i* 


- mi ml ^ m" fiu,^ + (^ 2 

+ mj Q mr* “ (^ T ^) ^ 1 '** + (” 2 ^) 

+ (-l)’'-a ^ mi iin-3i** + (-1)"'-* _ 2 ^ Mn-2 4 * 

- m? [mr^ - (” ~ ^) mr* fe** + (^ 2 ^) ^2.,* - 

+ . . . 

+ (-«■-' ’»!■'' („!i)[’»--(0«- 
+ (-«.»?(;) 

Adding vertically each column, we obtain 
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+ (- 




+ ( — 1)“ Mn:*i 

The first row of the above expression is equal to m"(l — 1)" = 0; the second 
row is equal to 


_„_i _ f n! n! -ra! (n — 1)! 

LoT^! ■ 1! (n - 1)1 “ 11 (» - 1)! ■ 1! (n - 2)1 


+ 


nl 


(n — 2)1 
2\(n - 2)1 11(71 - 3)! 


_ + (_ 1 )» 


nl 


(n- 1)!1!_ 


«![ 1 1 

mi 


+ 


2!(n-3)! 


+ (- 1 )" 


1 

(n- 1)!0!. 


^n-1 : 


■m'i ‘M1*x Yi 1-^ “ n-lCl + n-lC^ -•••-{- ( — 1)" „_iC„_i] 


= - 1)"-' - 0; 

the third row is equal to 
nl n! 


mi ’“jSs:** 


nl 


(n - 1) ! 


0!n! 2!(n-2)! l!(n-l)! 2!(n-3)! 

n! (n — 2)! 


+ 


+ (-ir 


n! 


!2!] 


_ ™n— 2r: 2i(n 1) ri ri 1 n 

— nil M2:»x — n-2^1 'T n— iL-z 


2! 


2!(n-2)! 2!(n-4)! ' ' (n-2)!2! 

.. 4- (-!)"-> 


= ml 




and similarly all the other rows turn out to be zero except the last one which is 
equal to (-l)’‘p„:,. 


(14) 


An-p — ( 
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This may be rewritten as 

1 fhn,p “ 

. 

M2n+l-p = — 

or in standard units 

I OC2n:p “ ^^riizx 

CtU^lxp = — «2n+l:2* 

The results!^ of (15) are important and fundamental because they establish 
the relation between the Theory of Inverse Sampling and the Theory of Sam- 
pling. Therefore we may formulate the following theorems: 

Theorem II. The even moments of the distribution of the hypothetical means 
of a parent population about its mean are equal to the corresponding even mo- 
ments of the distribution of the sample means about the mean. 

Theorem III. The odd moments of the distribution of the hypothetical 
means of a parent population about its mean are equal to the negative of the 
corresponding odd moments of the distribution of the sample means about the 
mean. 

Since the even moments of the two distributions are the same, while the odd 
moments differ only in sign, it is evident that for symmetrical distributions, the 
two curves f{t) and P(t) are exactly identical, because in a symmetrical distribu- 
tion all the odd moments about the mean are bound to vanish. In case of 
nonsymmetrical distributions, the curve P(t) is nothing but a vertical reflection 
of the curve f(t) as shown in the figure : 



In other words, if /(<), for instance, assumes Pearson’s Type III Function, then 
P(i) also assumes Pearson’s Type III Function except that their skewness is 
different in sign though equal numerically. We therefore state our theorem as 
follows: 


n So far as the writer is aware, these theorems were first developed by Professor H. C. 
Carver. 
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Theorem IV . The curves for the distribution of the hypothetical means of the 
parent population and the curve for the distribution of the means of the sample 
obtained from the parent population are symmetrically situated and one is a 
vertical reflection of the other. 

Part in. Inverse Sampling Associated 'with a Normal Parent Population 

We shall be concerned in this part of our discussion with a normal parent 
population. In aecorda,nce with the characteristics of a normal parent popula- 
tion we wish to investigate the most probable values of its mean and variance, 
thereby obtaining the distributions of the hypothetical means and variances of 
the parent population. 

Section I. Most Probable Value of the Mean of the Parent Population 

In Part I, Section III, we have mentioned Pearsonian Types of Frequency 
Curves whose differential equation is 

Idy _ a — i 
t dt hf^ -|- -|- 2)2 

It is clear that the mode of these curves is at 2 = a, provided the mode exists. 
But to recapitulate: 


_ ““3 

2(1 -I- 25)’ 

where 


S 


2a< — Sttj — 6 . 

«4 + 3 


consequently for the mode of the distribution of sample means, we have 


(16) 

where 


2(1 -H ’ 


(17) 




2«4;r. - 3a 8 - 6 


2(s - l)(s - 2)(s* + s-Qrs + 6 ri)(a 4 :* - 3) 

— 12s(r — l)(s — 2)(s — r — 1) — 3(s — l)(s — 3)(g — 2 r)°a 3 , 

(s _ 2) {(s - l)(s* + s - 6 rs -b 6 r*)(« 4 ;* - 3) 

— 6 s(r — l)(s — r — 1) 4 - 6 r(s — r){s — 2)(s — 3)} 


2r 


(18) 


Zx — M 




Zx — — X 




- 1 

r(s — r) 


as:* 


r(s - 


2(1-1- 25,j,) 


1 ) 
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Now according to Theorem IV, the mode of the probability function P(t) is 
situated symmetrically with respect to the mode of the frequency function f{t) 
of the distribution of sample means ; hence, for the mode of the probability func- 
tion of hypothetical means of the parent population, we have 


(19) 


— Mx 
/ s ~ r 
y r(s - 1) 


s — 2r / s — 1 ' 
s — 2 y r(s — r) * 


2(1 -f- 25,,.) 


where di^ remains unchanged because it is a function of and each being 

always positive. 

Solving for M, , which will now be the most probable value of the mean of the 
parent population and hence denoted by J0'*, we have 


( 20 ) 


M, = wii — 


s — 2r ffias X 
r{s ~ 2) 2(1 + 26J 


It is interesting to note that if s = 2r, this expression yields M* = mi , irrespec- 
tive of the law of distribution of the parent population provided only that 5,, 
is not exactly equal to — ■!. But since the Pearson’s function is used for gradua- 
tion, one should not fail to see that the mode so obtained gives only an approxi- 
mation to the true mode. Therefore we state a theorem as follows: 

Theorem V. If a sample is composed of one-haK of the variates of the parent 
population from which the sample is chosen, then the best approximated ‘most 
probable value’ of the mean of the parent population is equal to the mean of the 
observed sample provided only that is not exactly equal to — §. 

It is further observed that if as;, = 0 but 5,,, , then the expression (20) 

will likewise yield Mx ~ mi. But as., = 0 implies that the frequency curve of 
the parent population is symmetrical. Hence 

Theorem VI. For any symmetrical curves associated with the distribution of 
the parent population, the best approximated ‘most probable value’ of the mean 
of the parent population is equal to the mean of the observed sample provided 
6,,, is not exactly equal to — 

But we will investigate further the most probable value of the mean of a 
normal parent population, and we know that in a normal distribution the 
moments .bear the following relation:** 


(21) 


(2n)! 

«2» == KT j 

2" n\ 

as„4-i = 0 


Carver, H C., Frequency Curves, Handbook of Mathematical Statishcs, p. 97. 
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i.e., 0 !g = 0 

a4 = 3 

ag = 0 

ttg = 15 

a? = 0 

as = 105 


etc. 

Consequently for a normal parent population the function in (17) is 
immediately reduced to 


( 22 ) 


, _ 2s(r — l)(s — — 1) 

s(r — l)(s — r — 1) — r(s — r)(s — 2)(s — 3) 


Let us, first of all, investigate the possibility that this expression will be 
exactly equal to — ^ for positive integral values of r and s. 

Suppose we set 


2s()' — l)(s — r ~ 1) _ 1 

s{r — l)(s — r — 1) — r(s — r)(s — 2)(s — 3) 2 

and solve r in terms of s. Thus we obtain 


/'oo^ _ ® r — 10s + 6)* + 20s (s — l)(s® — 10s + 6) 

______ 

If s ^ 10, then the second term on the right side is positive. As it is absurd 
that r should be greater than s, therefore the positive sign of the double sign 



right member will be negative. Since r cannot be negative, no positive integral 
values of r and s, for which s ^ r, can satisfy (23). For s < 10, there are only 
nine positive integers; and direct substitution of each will tell us that only when 
s = 1, 2, or 3, r is a positive integer which is either 1 or 2. As these are trifle 
cases because a parent population can never be so small, we may safely say that 
for a normal parent population 

(24) iifi = mi 


Theorem VII. For a normal parent population, the best approximated ‘most 
probable value’ of the mean of the parent population is equal to the mean of the 
observed sample from it. 

For an infinite parent population, i e., s — > <» (20) yields on reduction 


(26) 


Ttjr I 0-3 X 

Af* = mi 


?' 2(1 -}- 251 ^) 


6 




- . gLr [(from 17)' 
(of4:x - 3) - 6r 


where 
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Formula (25) yields immediately M* = mi if as. j = 0 and Sz^ ~ i. For a 
normal parent population 5^^ = 0. Hence Theorem VI and Theorem VII both 
hold for the infinite case. 

Section II. Most Probable Value op the Standard Deviation op the 

Parent Population 

To find the most probable value ">f the standard deviation of the parent popu- 
lation, we shall assume the mean of the parent population to be the best ap- 
proximated ‘most probable value’ of the mean, which we have obtained in the 
preceding section. This assumption is necessary since we do not know the true 
mean of the parent population. 

Now, to start with, we shall consider ,Cr possible samples, each consisting of 
r variables. The second moment of each sample computed about the best 
approximated ‘most probable value’ of the mean of the parent population may 
be written as 

^ ((ji - mi)’ -f (*2 - mO’ + (xs - mY -f • • • -1- (a:r ~ Wi)’} 


~{ix2 - mi)’ + (aja 
r 


miY + (*4 - miY 4 . . . . + («,+! - mi)’} 


Z(.) = ^ {(x.-r+i - mi)’ 4 (a;,_r+2 - mi)’ 


4 (x._,+3 - mi)’ -1- ... 4 (as, _ mi)’} 


If we write (x; — mi)’ = j/i, it is clear that the above may be considered as a 
distribution of sample means drawn from a parent population yi, Vi, ys y,; 
and consequently 


( 26 ) 


/ s — r 
___ 


! _s-2r,/s — 1 

“ 8 - 2 y 


— 3 = 


(s — 1) (s’ 4 s — 6rs 4 6r’) , 
r(s — r) s — 2) (s — 3) ~ 


6s(r — 1) (s — r — 1) 
t{s -- r)(s — 2) (s — 3)' 
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Now the nth moment of y about -a fixed point may be written as 

JUniv = = Wj)^" 

^ ^12 {(!« - M,) + (M, ~ 

(27) " i^in.x ^ ^ ^ ASn— 1 X (,Mx — “f^lj 

+ (^ 2 ^ hn-i-x (.Mx - my + ^ /i2n-3.* (M, - mi)^ 

+ mn-i-z (Mx - mY + • • • + (Mx ~ my". 


On the assumption that our parent population is normally distributed and 
due to the fact that in a normally distributed function 


« 2 „ = 


(2n)! 

2’‘'n.! 


a^d a:2„+i = 0 


[See (21)], 


the expression (27) immediately takes this form: 

2n! 2 „ , /2n\ (2n - 2)! _ ni _ 2 n -2 

5^!'- 


l^n.v 


(28) 


+ Cr) ■'5 “" + ■ • ■ + w- “ 


Imposing the condition mentioned at the beginning of this section (i.e., M, 
assumes its best approximated 'most probable value’ mi), then all the terms drop 
out except the first one. Hence, as a final form, we have 


(29) 


f*n:v ~ 


Sti! 2 n 
2^1 


Hl;y — My — tf x 
l^'.y = 
tiZ.y = 15ff® 
m;y — lOSflT® 

etc. 


( 30 ) 
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It follows that the ifcth moment of y about its mean will be 

- (i) Ml 

_ 2fc! 2k (2fc — 2)! 2k I /^\ ~ 4)! 21 . 

\1/ 2*"i ()b - 1)! ^ V2/2*'-2 (it _ 2)! 


„[■ 2fc! /k\ (2k-2)\ (k\ {2k -i) 

(31) “ L 2 * • k\ \lj 2‘-‘ (A: - 1)! ^ W 2'‘-» (Jb ~ 2)! 

+ (- 1 )*! 


2fc 2A!! n fc k{k — 1) 

2* • A:! L 1! (2A: - 1) 2! (21^ - 1) {2k - 3) 

k{k -l){k- 2) 

3! {2k ~ 1) {2k - 3) {2k - 5) 

4 . ... 4 . (_ 1 U 

^ ^ A;!(2A:- 1)(2^ - 3)(2A: - 5) ••• (3) • (1) 


(32) 


And therefore we obtain 


fii.V = 0 

V — 2ffj 

hv, — 8<r® 

< M4:v = 60(T® 

M5:„ = 544crr 
ik-y = 6040 
etc. 


(33) 


oiziy = 2 - v /2 

a<,„ = 15 

" «6:v = 68 \/2 

as.], = 715 
etc. 
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Making proper substitution of (30), (32), (33) into (26), we obtain 


(34) 


Mzy = (tI 

_ ^ /2(s - r) 2 
y r (s - 1) " 

_ 2 (s - 2r) /2 (s - 1) 

s — 2 ^ r{s ~ r) 


12 (s — 1) (s^ + s — &rs + 6r^) — 6s (r — 1) (s — r — 1) 
r(s — r) (s — 2) (s — 3) 


For an infinite parent population, i.e , s — ^ <», we have 



Now again with reference to Pearsonian Types of Curves for which the mode 
is at < = a, we have for the mode of the distribution of sample means Zy , 


(36) 


(37) 




M; 


* 1 / 


JZy 


^3 Zy 

W+^) 


where 


20=4 zy - 3«1 - 6 

OiiZy 3 

2 (s — 3) [4(s — 2?-)^(s — 1) + 2r(s — 2)^(s — r)] 

(s — 2)[2(s — 1)(5^ + s — 6rs + 6r^) 

_|_ r(s — r)(s — 2)(s — 3) — s{r — l)(s — r — 1) 


Substituting (34) into (36), we obtain 


(38) 


f/ 


2(s — r) 
r{s — 1) 


s - 2r / 2(s - 1) _ 1 

s — 2 y r(s — r) 1 + 2Szy 


By Theorem IV, the best approximated ‘most probable value’ of the standard 
deviation of the parent population is obtained from (38) by changing the sign 
of the right member and replacing Zy by trh. Thus we have 





2(s — r) 
r(s — 1) 


s — 2r .'/ 2(s — 1) 1 

s — 2 y r(s — r) 1 + 2Sj„ 
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Solving for a-*, which is now the best approximated ‘most probable value’ and 
should therefore be denoted by a*, we then have 


(39) 


al = 




mz 

r7 ~ 2(s - 2r) 

' + r(s - 2)(1 + 25 J 


The best approximate ‘most probable value’ of the standard deviation may 
therefore be written doTO as 


1 




1 + 


2(s — r) 


where a. 


V' 


m2 


ris - 2)(1 + 25.„) 


This formula is, of course, subject to a systematic error that arises from the fact 
that we employ the square root of the best estimated ‘most probable valufe’ of 
the variance. Itmay be shown, however, that when r is large, the error is small.^"' 
Consequently, we have the following theorem ; 

Theorem VIII. For a normal parent population, the best approximated 
'most probable value’ of the standard deviation of the parent population is 
equal to 


i/ 


1 + 


2(s - 2r) 


r(s — 2)(1 2&ty) 


where o-, is the standard deviation of an observed sample from the parent popu- 
lation ando-,„ is a function of r and s as expressed in (37). 

It is interesting to note from (39) that when s = 2r, = v, provided 62,, 5^ - 1 

However, from (37), Si^ cannot be equal to -§ in the case of s = 2r, where s 
and r are both positive integers. Consequently, we may state this fact in 
another theorem; 

Theorem IX. If a sample is composed of exactly half of the variates of a 
normal parent population, then the best approximated ‘most probable value’ 
of the standard deviation of that parent population is equal to the standard 
deviation of an observed sample from it. 

For an infinite parent population, (39) yields on reduction 


(40) 




for (Tij = 0 when s 

T ^ Z 


Professor H. C. Carver has worked out a relation between the most probable value of 
and that of x by assuming that the latter is distributed according to a Type III distribu- 
tion. With his permission, I state the result as follows : 


where X 


~ and Mt 

(Tj 


M. P V. = 


(M P. V x)” 



the distance of the mean from the origin 
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Theorem X. For an infinite normal parent population, the best approximated 
‘most probable value’ of the standai’d deviation of the parent population is 

equal to the standard deviation of an observed sample multiplied by 


r + 2' 


Section III. Distribution of the Hypothetical Means op the Parent 

Population 

In the preceding two sections, we have obtained the best approximated ‘most 
probable value’ of the mean and the best approximated ‘most probable value’ 
of the standard deviation of a parent population assumed to be normal. We 
are now in the position to characterize the distribution of these hypothetical 
means by assuming that the best approximated ‘most probable value’ of the 
mean of the parent population be its mean and the best approximated ‘most 
probable value’ of the standard dexnation of the parent population be its stand- 
ard deviation Such a characterization is subject to its own probable error. 

Due to the fact that our parent population is normal by assumption, formulae 
(4), which we are to use this time, have to be modified by the proper substitution 
of the recursion relation of the moments of a normal distribution [See (21)]. 
After such modifications, they assume the following forms: 

'M., = M* 

s p - 

M2 21 — *2 "2 M2.2; 


( 41 ) 


M8.2* = 0 

Ml 2. = ^, + 


Ms t% ' 0 

15s 


** “ (^6-1- 3P4P2S -b Pls')jil.x 

In accordance with Theorems II and III, we therefore have for the distribution 
of the means of the parent population the following: 
f Mux - wii 

- - ^D- 


(42) 


M 3 Mx 


-M3.23, = 0 


u, Mx = ^ (D4 -h Ph)fiL = ^iPi + PUr^ 


2.x 


M 6 ;A/* 


tiiiMx — Me 2* 


15s 


-M6.2a, = 0 

15s 


(Pj -f- 3P4P2S -b Pls^)!i.l.x 


= ^ (P, -b 3P4P.S -b PWhlx 
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Consequeatly 


(43) 


= mi 

O-Af* 
otzut = 0 
OliU, 




1 ) 


3 = - 


(44) 


68(r — l)(s — r — 1) 
r(s — r)(s - 2) (s — 3) 

For an infinite parent population, i.e., s -+ » , we Kave 

{ Mm^ = mi 

w 


ffM, = 4/ = "7=^^, [froia (40)] 

Vr Vr V r + 2 Vr + 2 


<XZ‘Mx — 0 

— 3 = 0 


Now if we can find the equation of the curve associated with the distribution 
of the means of the parent population, we shall be able to ascertain the prob- 
ability that a mean lies within certain limits after a sample from the parent 
population has once been observed. 

Let us illustrate this by again referring to the same problem of the heights of 
1000 freshman students as recorded in Table I. Considering this as our parent 
population which is almost normal with s = 1000, we take every tenth indi- 
vidual height from the original list in which the 1000 heights are tabulated. 
Thus we obtain a sample with r = 100. The frequency distribution of these 100 
individual heights is shown in Table IV. 


TABLE IV 


Sample of 100 Heights Selected from the Parent Population of 1000 from Table I 


Class 

Frequency 

62.5-64.4 

9 

64. 5-66 . 4 

16 

66.5-68.4 

31 

68.5-70.4 ' 

29 

70.5-72.4 

13 

72,5-74,4 

2 
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We compute the mean, the standard deviation, the skewness, and the fourth 
moment about the mean of this sample: 


mi = 67.99 
m 2 = 5 415,2 

m 3 = -1 549,872 
m 4 = 71.615,158 

From Theorem VII, 

= 67.99 

From (37) and (39), we obtain 

= -.099,833 
= 5.328,067 

Substituting into (42), we have 


<T. = 2.327,058 

as-. = -1.229,91 
a4,= 2.442,17 


Mux 

= 67.99 



Ik.Mx 

= 048,000,603,6 

O'Ux = 

.219,09 


= 0 

“3 V* = 

0 

Ml.M# 

= .006,898,429 

Otl.Ma, = 

2.994,03 


= 0 

Cti'Mx = 

0 


= 001,649,027 

ae-Mx = 

14.910,37 


The coefficients of Charlier's Type A Function (3) are as follows : 

Cs 


3! 

4! 


= 0 


= - 000,250 


= 0 
5! 

6 ! 


= 000 , 000,1 


From the values we are justified in assuming that Mxis normally (fistributed. 
We may now ask ourselves concerning the probability that the mean of the 
parent population, M*, from which this sample is selected, exceeds 68.5 inches. 


t 


•Mf — Mjtfj _ 68.5 — 67 99 _ g 3278 


ffMj. 


.21909 

= / V«) dt = .009962 

}i.3tn 
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Let US now come back to investigation of the general case for the distribution 
of the hypothetical means of the parent population. Because there is no definite 
relation between the values of r and s, except r ^ s, and because, by assump- 
tion, our parent population is normal, is a function of r and s (22) ; that is 

_ 2s(r — l)(s — r — 1) 

s(r — l)(s — r — 1) — ^(s — r)(s — 2)(s — 3) 

Consequently, it is necessary for us to investigate for different values of 5** with 
respect to various combinations of r and s before we can tell which Type of 
Pearson's Curves will best fit the distribution of the means of the parent popula- 
tion. Hence, Table V: 

TABLE V 


Relation of the Values of with Various Combinations of r and s 


j 

V ^ 100, 


- . 0020 


s = lOr \ 

r S 50, 


- 0040 


* 

ir S 10, 

5.. & 

-.0189 


1 

[r g 100, 




,s = 5r ^ 

r ^ 50, 




i 

1?- S 10, 


- 0397 


1 

[r ^ 100, 




s = 2r ^ 

S’ ^ 50, 





[j- k 10, 


-.1118 


s = r + 1, 

r = any finite value. 

Szx — 

0 

s = any finite value. 

r = 1 

^2* — 

0 

S — » CO, 

r = any finite value. 

fizj = 

0. 

Prom the above table we observe: 





1) For an infinite normal parent population, the frequency distribution of the 
hypothetical means of the parent population is normal, because both as at* and 

are equal to 0 (See Part I, Section HI). 

2) For any finite, normal parent population, if ?■ = 1, the frequency distribu- 
tion of the hypothetical means of the parent population is normal. 

3) For any finite, normal parent population, if a sample r = s — 1 is chosen, 
the frequency distribution of the hypothetical means of the parent population 
is normal. 

4) For any finite, normal parent population, if s is equal to 5r or more and at 
the same time r is at least equal to fifty, the normal curve is a fair approximation 
for the distribution of the hypothetical means of the parent population. 
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5) For the other cases in which \ 5,^, | is not negligibly small, we ought to make 
further investigation. 

Now, to carry out further investigation for the cases where i dz,, \ is not very 
small, we need only look back to formulae (43), from which we observe that : 
«4 itf* — 3 < 0 for s 5^ r + 1, r 1, or s does not approach infinity. 

Because of the fact that aatAfx = 0 and a 4 < 3 is the criterion for Type 11,1® 
we conclude that Type II will be the best fitting curve for the cases mentioned 
in 5) above. To obtain this Type II curve we proceed as follows: 

Let the equation of the curve associated with the distribution of the hypo- 
thetical means of the parent population with which we are concerned be 
y = PmM- Then 

I dy _ a — t _ a — I 

y di bfl -f- bit -f bit^ — b 2 (i -j- R) (E — i) 

where 

7? =: - hi ± Vbl - 4bobi 

2bt 


By proper substitution with the formulae in Part I, Section III, we obatin 

(45) 


p _ — «3 Ws ± ~ 45zi(2 4- 5sx) 

^ C 




1 


since as .v, = 0 from (44) 


For the same reason a = 
may be rewritten as 


— «3 Mx 

2(1 + 20 


0; therefore the differential equation 


I dy _ I 
ylt ~ b 2 - Py 


from which we obtain 


(46) 


y = ya (R'^ - fY where O' = “ ^ 


Imposing the condition that the total area under the curve be equal to unity, 
we set 


1 = 



ydt = yo 



t2)5 dt 


Elderton, W P., op. cii , Table VI, opposite p 46. 
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Substituting < = — ii + 2Rti, we have 

1 = j'o (2B)“»+i ju«(l - ny du 

= i/o(2fi)=«+’ /3(? + 1, g + 1) 

. _ _ 1 r(2g + 2) 

(2fi)2m • r(g + 1) r(Q- + 1) 

r(25 + 2) 


hence 


(47) 


y = 


(2fi)*^+ir(9 + 1) r(^ + 1) 

1 r(2g + 2) 


{m - «“)» 


^ 1 r(2g + 2) A _ y 

V2q + 3 ' r(g + 1) r(g + 1) V 2q + 3/ ’ 


where q may be expressed in terms of r and s by means of (46) and (22). Thus 

/.(ON „ _ 1 ^ _r(s - r)(s - 2) (s ~ 3) - 5s (r - l)(s - r - 1) 

(48) s - _ - - 2 2.(r_l)(s-r-l) 


To sum up: In describing the distribution of the hypothetical means of a 
parent population from which our sample is chosen, we have the following 
theorems: 

Theorem XI. The frequency distribution of the hypothetical means of an 
infinite, normal parent population is normal 

Theorem XII. The frequency distribution of the hypothetical means of a 
finite, normal parent population is normal if r = s — 1. 

Theorem XIII. The frequency distribution of the hypothetical means of a 
finite, normal parent population is very nearly normal if s is equal to 5r or more 
and r is at least equal to fifty. 

Theorem XIV. The frequency distribution of the hypothetical means of a 
finite, normal parent population is according to Type II for the cases in which 
I j is not negligibly small. 


Section IV. Probable Error op the Mean 

To measure the fluctuation of a sample mean from the true mean of the parent 
population, it is customary to use the term “probable error’’ to denote the 
expression: 

(49) Em = 0 6745 ~ 

V T 

where ff I is the standard deviation of the parent population. As the true value 
of ffx is n ot known, it is the common practice to substitute for it the value 

^ ®'x is the square root of the expected value of the sample 
second moment. 
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Therefore (49) is rewritten as 

(60) Eu = 0.6745 - 7 =^ 

vr — 1 

Still, it should be noted, this expression is an approximation. Now from our 
theory of inverse sampling, as far as a normal parent population is assumed, 
we have obtained for the probable error of the mean 

(51) Em = 0.6745 

Vr + 2 

where a, is definitely the standard deviation- of an oljserved sample. Although 
for large r, (50) and (51) do not differ much, yet (51) is obtained directly in 
terms of the standard deviation of an observed sample. 

To illustrate, consider the same sample of the heights of 100 freshman students 
(See Table IV) as obtained from an infinite parent population. Since the mean 
is 67.99 and the standard deviation is 2 327058, the probable error of the 
mean is 


Em ^ 0.6745 X 


2.327058 

vio2 


.1554162; 


that is, Ms = 67.99 ± .1554152, which shows that the chances are even that the 
true mean of the parent population lies within the range 67.834,584,8 and 
68.145,415,2. 


Section V. Distribution of the Hypothetical Variances of the Parent 

Population 

Recalling the fact we have stated in Part III, Section II, that the considera- 
tion of the distribution of the second moments of samples about the most 
probable value of the mean is equivalent to the consideration of a distribution 
of sample means drawn from a parent population yz, yi, yz, • • • TJi, where = 
(a;. — my since in a normal parent population M* = rrii [See (24)] we can write 
down in perfect analogy with (12) and (14) 


(52) 

Now 


Mn. p — ( 1)" Mn 

Mp = m 


^n‘p — — Mn, 




ix{ — wii)® 


N 


y-n p,.* 


since we have assumed the mean of the parent population to be its most probable 
value, i.e., wii. Hence by virtue of (52) and (34), the frequency distribution of 
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the hypothetical variances of the parent population, which is assumed to be 
normal, is characterized by 


(53) 


_ _ _ _ i /2(s - r) _j _ ^ /2(s - r) 

<rz, y _ 1 ) y ris ~ 1) * 


since we assume the most probable value of the variance of the parent population 
to be its variance. 






2(s - 2r) / 2(ri~T) 

s — 2 r{s — r) 


«4 llfx — 3 = ai:iy — 3 — 


12 (s — 1) (s^ + s — 6rs + 6r^) ~ 6s(r — 1) (s — r — 1) 
" r(s — r) (s — 2) (s — 3) 


For an infinite parent population, i.e., s — > oo, we have 


= m 



Now if we can find the equation of the curve associated with the distribution 
of the hypqthetical variances of the parent population, we shall be able to 
ascertain the probability that a vanance lies between certain specified limits 
after a sample is drawn from the parent population. 

For illustration, we will use the same sample of the heights of 100 freshman 
students (See Table IV) as selected from a parent population of 1000. 

We have s t= 1000, r = 100 
mi = 67 99 

= 5 4152, or V, = 2.327058 

From (37) we compute 

5,„ = -.0098 

As I I is negligibly small, we may be justified in considering fii-x to be 
distributed according to Type III (Part I, Section III). 

It follows from (39) that 

Afc* =5.32 
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We compute the moments of the distribution of the hypothetical variances in 
accordance with (53). Thus 

= 6 4152 
= -556 

«3:p,=.= .239,946 
= 3 055,75 

If we now wish to ascertain the probability that the variance of the parent 
population lies between }k.x = a = 5.5 and jh-x == i ~ 6.5, we first convert a, h 
into standard units such that ta = .1525 and U — 1.9511 and then evaluate the 
following integral 


4 



But this step is now not necessary since we have access to Tables of Pearson’s 
Type III Function.*’ Hence we find from this table our desired probability. 

P = .39146 

In the above numerical example, we are justified in using Type HI because 
I Sxji I is negligibly small. But for the general case, however, we ought to make 
further investigation concerning the values of . 

TABLE VI 


Relation of Values of vnth Various Combinations of rands 



1 

r 

^ 100 

6,, fe - 


5 = 

lOr \ 

r 

S 50 

Sx, ^ - 

0194 


1 

:r 

S 10 

^ - 

.0859 


1 

fr 

S 100 



S = 

5r -j 

l’’ 

^ 50 

S.y ^ - 




[r 

S 10 

Sxy ^ - 

.1983 



[r 

^ 100 

Sxy ^ - 

.0518 

5 = 

2r -I 

r 

O 

All 

Sxy ^ - 

1073 


1 

[r 

^ 10 

Sxy ^ - 

.7642 


s — > 00 , r = any finite value, = 0. 


Elderton, P E,, op cit., p. 90. 

” Salvosa, L. R., Tables of Pearson’s Type III Functions, Annals of Mathematical 
Statistics Vol. I, No. II. 




94 


CHING-LAI SHEN 


Recalling that 5z„ is a function of r and s such that 


(s — 3){4(a — 2ry(s — 1) + 2r(8 — 2)^(s — r)} 

(s - 2) {2(s — 1) + s — &rs + 6r^) -f r{s — r)(s — 2) 
(s — 3) — s{r — 1) (s — r — 1) } 


we construct Table VI of for different combinations of s and r. 

From Table VI we observe the following facts . 

1) For an infinite, normal parent population, the distribution of the hypo- 
thetical variances of the parent population is according to Type III. 

2) For a finite, normal parent population, if s is at least equal to 5r and r 
at least fifty, the distribution of the hypothetical variances of the parent popula- 
tion is very nearly according to Type III. 

3) For the other cases in which 5*, is not small but negative in sign, the 
distribution of the hypothetical variances of the parent population needs further 
investigation. 


a* 

From Part I, Section III, k = .. ; and since we know that S is always 

4o(2 + 0) 

greater than —2, therefore whether k is positive or negative depends upon 
whether S is positive or negative. 

Now from Table VI we observe that seems to be always negative; hence k 
is negative. In accordance with the criterion for fitting curves, the frequency 
distribution of the variances of a normal parent population in such cases is 
according to Type I, which takes the form:*® 


(55) 

where 


y = 




1 


(Ri - 


« - R,)”"(Br - tr 


^ tt — R2 ^ a — Ri 

~ ba(R2- Ri)’ ~ hiRi - R 2 ) 


Ri, R 2 are the positive and negative roots, respectively, of the equation bo -f 
hit + = 0 and can be expressed in terms of'the first four moments : 

Ri R = — ± — 48(2 -f 5) 


We may sum up the foregoing in the following theorems ; 

Theorem XV. The frequency distribution of the.hypothetical variances of an 
infinite, normal parent population is according to Tyrpe III. 

Theorem XVI. The frequency distribution of the hypothetical variances of a 
finite, normal parent population approximates to Type III Curve if r and s are 
of such combinations that 1 1 turns out to be negligibly small. 

Theorem XVII. The frequency distribution of the hypothetical variances of 


” Elderton, W. P., op, ciL, p 64-. 
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a finite, normal parent population is according to Type I in case that is not 
very nearly equal to zero and is negative. 


Part IV. Inverse Sampling Associated with a Parent Population Distributed 
According to Pearson’s Type III [Inunction 

Instead of a normal parent population as we have assumed throughout our 
discussion in Part III, we shall assume in this part a parent population which is 
distributed according to Type III. Therefore, besides the distribution of the 
hypothetical means and that of the hypothetical variances of the parent popula- 
tion, the distribution of the hypothetical third moments will also be considered. 
We shall carry out our discussion in practically the same way as we have done in 
Part III. 


Section I. Most Probable Value of the Mean of the Parent Population 


We have already obtained a general expression for the most probable value of 
the mean of the parent population: 



s — 2r (TxOii-.x 
r(s --2) 2(1 4- 25J 


where as before 






_ — 3«8 — 6 

-|- 3 


But we are now concerned vdth a parent population which is distributed accord- 
ing to Type III. 

Since the recursion relation of the moments of Type III distribution is of the 
form 


(56) a„+i = n ^a„_i -f ^ j 

ui = 3(1 4- t) where 7 = -^ 

q!6 = 2a3(5 3 t) 

at = 6(3 137 67 ®) 

a ^ = 803(35 -j- 777 + 3O7*) 

08 = 7(15 4- 1707 4- 26I7' 4- 907^) 

09 = 403(315 4- 16527 4- 20077* 4- 63O7’) 

010 = 9(105 + 245O7 4- 84357= -f 86687* 4- 25207*) 

011 = 5 o 3(3466 4- 352667 4- 9197I7* 4- 829627 * + 226807 ^) 

ou = 11(945 4- 393757 + 2522457* + 5377777* + 4374907* + 1134007*) 

etc. 



% 


CHING-LAI SHEN 


it follows from (5) that for a Type III distribution of the parent population 


(M., = M. 


(57) 


8 — 2r / s — 1 

« 3 .., - y «..» 

^ . (s - l)(s* + s — 6 rs 4 - 6 r“) al.^ 
r(s - r)(s - 2)(8 - 3) ' 2 


6 g(r — l)(s — r — 1 ) 
r(s — r){8 — 2)(s — 3) 


Therefore for the most probable value of the mean of the parent population, 
we have the same form as ( 20 ) : 


Mx ~ mi 


s — 2r (Txaz-.x 

2(1 + ' 


except now instead of (17) 
(58) = 2 - 

04 !«® 4 3 


2 _ - 3)(2(a - 1)(8 - 2 r)‘‘«^,, 4 ^r(s - 2 )^(s - r)} 

(s — 2 ){(« — l)(s'“ 4 5 — 6 rs 4 5J'^)a8:» 

4 4r(s ~ 2)(s ~ 3)(s — r) — 4s(r — l)(s — r — 1)} 


We observe that if 03 ;* = 0, this comes back to the case of normal parent 
population which we have already treated in Part III. 

But if 8 00 while as;* is finite, then 5,,. = 0 . Therefore, for the limiting 

case, i.e,, when the parent population is infinite, we have 


(59) 


Mx = mx — — (Tx oi8i 
2 r 


Z 


Since <r* and « 8 :* are not known, we impose the condition that they assume 
their best approximated ‘most probable values’ respectively. Hence, we rewrite 
(20), (59) in the following forms; 


(60) 

where now 


Mx = mi — 


s — 2r Ox asir 
r(a - 2) 2(1 4 25.,) 


(60b) 


(s - 3) {2(8 - 1) (8 - 2ry- 4 8r(s — 2)^ (s - r) } 
(s — 2 )((s — 1 ) (s* 4 s — 6 rs 4 6 r*) as,, 

4 4r(s — 2) (s — 3) (s — r) — 48(j* — 1) (s — r — 1) } 
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and for the infinite case 

(61) Mx = mi — ^ ffx as;* 

2r 

So we state our theorem: 

Theorem XVIII. For a parent population which is distributed according to 
Type III, the best approximated 'most probable value’ of the mean is the mean 
of an observed sample from it minus a correction factor which is a function of 
r, s, ffx, and aa-*- 

It is also interesting to note that when s = 2r and then M = mi. 

Section II. Most Probable Value of the Standard Deviation of the 

Parent Population 

We consider, as we have done in Part III, Section II, ,Cr possible samples, 
each consisting of r variates chosen from a parent population s. The second 
moment of each sample computed about the most probable value of the mean of 
the parent population may be written as 

zi=i{(xi- MxY + (a:a _ + , . . + 

r 

z, = i{{x,- MxY-V (xa - + . . . + (a:,+, - J0-.)’} 

T 


~ r ^ + • • • + (x, — M,)*} 

If we write (x, — M*)® = y., the above may be considered as a distribution of 
sample means drawn from a parent population yi, yt, yt, ■ ■ • y,. Therefore, 
as (27), 

ynxv = ikn-x “t" ^ ^ fkn-l.xiMx — Mx) “t" ^ 2 ^ yin~-i'.x{Mx — Mx)^ 

+ w„_, xiMx -^xy+ ■■■ + iMx - iS"*)'" 

When we impose the condition that the most probable value of the mean of 
the parent population be its mean, then the above yields 


Mn V ”*■ M2n.:« 


/ll.y = My = fiix 
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Consequently 

m, = - (J) n-,; M. + 


fiik-x 


+(-l)‘MS 

{k\ . _ (k\ 

I I I Wi-iiu M2 I I 2 ) 

+ 


-2 

y^2k-^‘X M2 « 



Mfc— 2.1/ 


2 

y 


Now from the fact that we assume a Type III distribution for our parent 
population, therefore we have 


(62) 


ik.v = M4* — Ma I = (^7 + 2)a^ 

M 3 :i/ = Me;x — 3/24,1 M2 * + 2/42, ^ = ( 307 ^ + 667 + 8)0-® 

M4.» = M«;x — 4/26.., M2 X + 6M4 .x M2 I — 3m2 ^ 

= (6307^ 4- 17077* + 9487 + 60)(r* 
etc. 


Substituting (62) into (26), we have 
(M,, = (T* 


(63) 


<r>v 




s — r 


(37+2) 




r(s — 1) 

s — 2r . / s — 1 3 O 7 * + 567 + 8 


S-. 2 
q _ (s 

04:*, — 6 = — 


V r{s^r] 


r) (37 + 2)=/“ 

1) (s^ + s - 6rs 4- 6r2) 6307^ + I68O7* + 9127 + 48 


r(s — r) (s — 2) (s ~ 3) 
6s(r — 1) (s — r — 1) 


' (37 + 2)* 


r(s - r) (s - 2) (s - 3)' 

For an infinite parent population, the above yields by allowing s — ♦ « 


'M.y = <rl 


2 

O’Zy A/ 

/37+2 

y 

r 

1 

307 ^ + 667 + 8 

0!3iZ|/ — " /-■ 

V r 

(37 + 

Ot-.zy — 3 = 

. 

1 6307 * + I 68 O 7 * + 9127 + 48 

r (37 + 2)=“ 


(64) 
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In accordance with (38), we write 


(65) 





s -2r / s - 1 307^ 4-567 + 8 

s - 2 y r(s - r) (87 + 2)3'- 
2(1 + 25,,) 


It follows from Theorem IV that for the mode of the standard deviation of the 
parent population, we have 


( 66 ) 


X) (a: — -Ml)® i2 

*— C g. 

r 

]/ 


s — 2r / s ~ 1 3O7® + 507 + 8 
J~2 y rjs - r) ~ (37 + 2)1 
2(1 + 25.„) 


where 


(67) 


— 6 

«4;»y + 3 


which is a function of r, s, and aa,®. 

Assuming the best approximated ‘most probable value’ of as,* for aa,* and 
remembering that 


iif* = wi — 


s — 2r &x&i.x 
r(s - 2) 2(1 + 2U ’ 


we write (66) in the form of 


nh + 


(1 + 2lj5 


a 2 _ „ 3O7® + 567 +8 *2 
m + 2)(1 + 25 ^ 


( 68 ) 


where 


^ X 


m2 

30^® + 56f + 8 2g^ 

^ (3f + 2)(1 + 25.,) (1 + 2CJ® 


_ s — 2r 
^ ~ 2r(s - 2) ’ 

= (60,5) where aa,* is replaced by aa;* 


Sxy = (67) where aa,* is replaced by as.® 
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We rewrite (68) in the abridged form; 


(69) 

or 

where 


A* — 


tth 


X, r, s) 


(ft 

<^(«3 I, f, s) 




r, s) = yj/ 


1 + 9 


29^y 


30y^ + 56 y + 8 

(3y + 2)(1 + 2SJ (1 + 25^2 


and state our theorem : 

Theorem XIX. The best approximated ‘most probable value’ of the standard 
deviation of a parent population which is assumed to be distributed according to 
, Type III is equal to the standard deviation of an observed sample of it, multi- 

For an infinite parent population, 3 = = 0 and 

Zt 

6 2(630f + 1680f^ + mj + 48)(37 + 2) - 3(30y^ + 56^ -h 8)° 

(37 -h 2)[(6307' + 1680^2 912 ^ + 48) - 6r(3i -t- 2)^“! 

Theorem XX. The best approximated ‘most probable value’ of the standard 
deviation of an infinite parent population which is assumed to be distributed 
according to Type III is equal to the standard deviation of an observed sample 

of it, multiplied by ^ , 

hm (t>(aa-x, r, s) 


Section III. Most Probable Value of the Skewness oe the Parent 

Population 

Let us again consider ,0, samples, each consisting of r variates chosen from 
a parent population s. The third moments of each sample computed about the 
most probable value of the mean of the parent population may be written as 

Zi = ~ {(xi ~ Mx)^ + (x2 — MxY + ■• -t- (a;r — iO"*)®} 

(70) Zlt = ^ {ix2~ MxY -b (rj - SixY -b • • • -f (Xr+l - 



~ { (®«-r+l — M i)® "b (3:«_r+2 


MxY + ■■■ +ix.~ MxY] 
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A 

If we write (a:, — M^r = w^, the above may be considered as a distribution 
of sample means drawn from a parent population tJ)i, Wg, • ■ ■ w^. Conse- 
quently in accordance with (5), we have 


(71) 


M.,„ == 


~ ^ r{s - 


1 ) 


O’ w 


^3 zn) 


s — 2r 
s — 2 


y r(s-r) 


as. 


a4',tu> ^ 

\ 


(s - l)(s2 -b s - 6rs + 6r-^) , ,, 

r(s - r)(s - 2)(« - 3) ' 


6s(r — l)(s — r — 1) 
r(s_l)(s-2)(s-3) ■ 


Let us write the analogous form of (27): 

= ^ + M,, ~ Mx) 


Mn t4> 


N‘ 


(72) 


:) 


+ + . . . + [m, _ 


Imposing the same condition as before that Mx assumes its most probable 
value (i.e., M* = Mx), then (72) becomes 


(73) 


— M3n z 

l^l,w ~ ~ P'^x 


The fcth moment of the distribution of w about its mean will then be 

52 (^ ~ ^«')* /A M 

PhlW — Mfc W \ly 

+ + (_1)^M^ 

{k\_ . ,{k\. 

— fiak.x “ I 2 I » /i3:* "r I 2/ 


-2 

}•« Pz 'X 


+ (-D^'aJ 


(74) 
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Since we assume a Type III distribution of the parent population, we have 
in accordance with the recursion relation (56) 

ilftu “ MS'a: “ O' ^ 

ik'.w — fii'x — jj-l.x ~ (15 + 687 -|- 307 ^) ol 
(75) Ms «) = ^9 I — S/Ze X MS:® + 2m3 j 

= «3 . (1215 + 64177 + 79387* + 2520y) 4 

piui — — 4m9;® Ms'x -h 6 m 6,® Ma.x — Sms X 

= (10395 + 4232257 + 272259972 + 58516837* 

+ 47922307* + 124740070 4' 

Substituting into (71), we have 
f Miu, = hx = asxol 

(307* + 637 + 15) 

_ s - 2r , /' S - 1 as:;, (1216 + 64177 + 79387* + 25207*) 
s - 2 y Ks - r) (15 + 637 + 3O7*)*'* 

, _ (s — 1) (s2 -j- s — 6r« -f 6r*) 

“ ,-(s -r)(s- 2) (s - 3) 

9720 + 4175557 + 27079927* + 58403437* 

+ 47895307* + 12474007* 

(15 637 + 307*)* 

6s(r — 1) (s — r — 1) 
r{s — r) (s — 2) (s — 3)’ 

Allowing s — > 00 , we have for an infinite parent population 

~ p3:x ~ OI3 X O ^ 

= (307* + 637 + 15) 

(77) , _ 1 as.* (1215 + 64177 + 79387* + 25207*) 

Vr (15 + 637 + 3O7*)*/* 

9720 + 4175657 + 27079927* + 58403437* 
g _ 1 + 47895307^ + I2474OO7* 

)’ (15 + 687 + 3O7*)* 
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The best approximated 'most probable value’ of may now be written 
after the same fashion as in the preceding cases : 

(78) «„ 2(r+2U 

where 


2ot4 ?ui — So;?., — 6 


“4 "b 3 


Since 


T.ix- 

r 


1 

r 



X — mi + g 


$'x aa a. Y 

1 + 2 IJ 


[from (60)], 


and since we assume the best approximated ‘most probable values’ of the 
standard deviation and the skewness for the standard deviation and the skew- 
ness of the parent population respectively, we obtain from (78) 


ff* aa:* = Wa -h 3wi2 


A A A 3 ^ 8 

^8 X 1 3 ^3 a 

1 -b 2Sij. (1 -j- 25jj)® 


&3-.X (1215 -b 64177 -b 7938-?^ + 2520-?®) ,3 
^ (1 -b 2dJ (15 -b 63^ -b 30-?0 


The change of /is.* to cl aa,* involves a systematic error although it is small. 
Again by proper substitution of (69) we have 

Va _ 3 J_ Q * /T <^3 X 

r, s) * ’ “ r, s) (1 -f 25^J 

3 a3 

4-^3 Tj “j; . ? ^ 

aa a, (1215 -f 64177 -b 7938^^ -f 2520-^0 
^ <t>^ i&3 X, r, s) • (15 -b 63f + 30^9 (1 + 2§ J’ 


Solving for aa.* , we have 

^ 3g<f>Ha3.x,r,s} 27 

(1 -f- 26jj,) (1 “b 

g (1215 -b 6417-? -b 79387=’ -b 2520^) '] 

( 14 - 2 D (15 -b 637 + 307=“) J' 

Since the right member of (79) is a function of as.x, r, and s, therefore the 
most probable value of az-.x may be approximated when we are given s, r, and 
the skewness of an observed sample. As it is an algebraic equation of high 
order in az-.x and is so much involved, even approximation presents practical 


(79) 


az , 


0!S.x 


<b^ (a.z-x . r, s) 
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difficulty. However, if once at * is approximated, ct^ and can be easily ob- 
tained from (60) and (68). 

Theorem XXI For the best approximated ‘most probable value’ of the skew- 
ness of a parent population which is assumed to be distributed according to 
Type III, we must approximate it from equation (79), in which the skewness of 
an observed sample is expressed as a function of s, r, and the best approximated 
‘most probable value’ of the skewness of the parent population. 

To construct a table for the best approximated ‘most probable value’ 03., 
corresponding to asu for particular values of r, s, we should first reverse the 
process by assigning different values of atx so as to obtain asj*; then by the 
way of interpolation, we shall be able to obtain as * for a particular at,^ . 

TABLE VII 

Belation of the Sample Skewness and ike Best Approximated ‘Most ProhahleValue’ 
of the Parent Population Whose Distribution is According to Type III 


(s — > 00 , r = 100) 


«3 « 

A 

Ota T 

.1 

0784 

.2 

1568 

.3 

2373 

4 

.3164 

.5 

.3969 

.6 

.4776 

,7 

6589 

8 

.6410 

.9 

7239 

1 0 

8072 

1 1 

.8905 

1 2 

.9737 

1 3 

1 0567 

1 4 

1 1392 

1.5 

1.2211 

• 1.6 

1 3022 

1.7 

1.3791 

1 8 

1 4578 

1.9 

1.5355 

2.0 

1.6122 

2.1 

1.6828 

2.2 

1.7609 

2.3 

1.8303 

2 4 

1 9024 

2.6 

1.9670 

2.6 

2.0371 
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For s — » oo and r = 100, we have computed the best approximated ‘most 
probable value’ of as.# corresponding to the values of as;, from .1 to 2.6 as shown 
in Table VII. 

The computation for such a table is laborious because it involves the compu- 
tation of 1,^, bzy, and which are in turn functions of as.,,, and a,;,,,, a 3 ;,„ and 
as and ds;,^ and ds-,,,, respectively. 


Section IV. Distribution op the Hypothetical Means of the Parent 

Population 


Since we have obtained in the preceding sections expressions for the best 
approximated ‘most probable values’ of the mean, the standard deviation and 
the skewness of a parent population which is assumed to be distributed according 
to Type III, we are now in the position to characterize the distribution of the 
hypothetical means of the patent population with the assumption that the best 
approximated ‘most probable values’ of the mean, the standard deviation, and 
the skewness be the mean, the standard deviation, and the skewness of the 
parent population. 

Basing upon the fundamental relations in (15), we write down the character- 
istics of the distribution of the h 3 rpothetical means of the parent population as 
follows : 


(80) 


'Mmx 

= mi 




(TMx = 

= ffzx = 


1 1 

- r 
-1) 




_ 5 — 

2r 

«3 Mx 

il 

1 


S - 

- 2 

Oli.Mx 

II 

CO 

1 

04 




1 ) 


as# 


r(s — r) 

(s — l)(s^ + s — Ors -f 6r'') TSds ,,”1 
r(s - r)(s - 2)(5 - 3) L 2 J 


6s(r — l)(s — r — 1) 
r(s — r)(s — 2)(8 — 3)' 


where <t>(aa,x, s, r) is given in (69). 

For an infinite parent population by allowing s —> oo, we obtain from the 
above: 


( 81 ) 


(Mmx = m 


O'Mx ~ 


I 0-3 Mx ' 


1 O’. 

a/t <^(ds.i, r) 

1 . 

— 7 = 0 ^ 3 . » 

vr 


OCi Ux — 3 — 



where ^(d*;*, r) = lim <^(dj:*, s, r) 
8— *00 
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Since we observe that the moments of the distribution of the hypothetical 
means are expressed in terms of aa.i, it is therefore necessary for us to find the 
best approximated ‘moat probable value’ of the skewness of a parent population 
before we attempt to obtain the frequency function associated with the distribu- 
tion of these hypothetical means. 

Numerical illustration. A sample of 100 weights of freshman students is 
observed and the frequency distribution is given in Table VIII, 

TABLE VIII 

Weights of 100 Freshman Students 


(Original Measurements Correct to Nearest Pound) 


Class Mark 

Frequency 

109.5 

4 

119.5 

11 

129.5 

25 

139.5 

34 

149.5 

14 

159.5 

8 

169 5 

0 

179.5 

3 

189.5 

1 

. 

100 


The first four moments are computed 

mi = 138 3 
= 14 6366 
«3’s = .81099 

ai . = 4 47644 

Now, assuming this sample is drawn from an infinite parent population which 
is assumed to be distributed according to Type III, we wish to find (%) the best 
approximated ‘most probable values’ of the mean, the standard deviation, and 
the skewness of the parent population, and (b) the probability that the mean of 
the parent population lies between — 135 and Mx = 140. 

By interpolation from Table VII, we obtain the best approximated ‘most 
probable value’ of the skewness of the parent population: 

03 , X = .6501 

Prom (69) and (61) we obtain 
^x = 14.5452 

Mx = 138.25272, r) = 1.006279 
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From (81) we have 

“ 138 3 

(7i,x = 1 45452 

0:3 Mx = 06501 

oci,Mx = 3.00633945 

= 0, the distribution of is associated with Type III Function; hence 
for the probability that Mx lies between Mx = 135 and Mx = 140, we again 
refer to Tables of Pearson’s Type III Function prepared by L. R. Salvosa,“ and 
we obtain in this case 

P = 8677592 

Since the determination of the best fit of a frequency curve in general depends 
upon the values of a 3 , a 4 , and k, and since in the present case each of them is a 
function of s, r, and az-x, we are therefore not able to tell the type of curve to 
be used until we know s, r, and 03 1 . 

For the infinite case, however, as we have illustrated Type III Function may 
always be used because 

5^ _ 2a4 z., - 3«1,^, — 6 _ 20:4 — 3 q!3.„, — 6 _ ^ 

‘ “4 Jx d" 3 “4iMx + 3 

holds for all values of as,* and r. We therefore conclude that the hypothetical 
means of an infinite parent population which is itself distributed according to 
Type III is distributed according to Tjpe III. Hence 

Theorem XXII. The hypothetical means of an infinite parent population is 
distributed according to Type III if the parent population is assumed to be 
distributed according to Type III. 

Section V. Distribution of the Hypothetical Variances of the 

Parent Population 

Parallel to Part III, Section V, the distribution of the hypothetical variances 
of a parent population which is assumed to be distributed according to Type III 
can be described. The fundamental relation of Theorems II and III hold: 


M2n*p — M2n zy 
/•tZn-f-l p — — /i2n+l Zy 

(x - M xy 
r 


or ^2n'v — 

Q^2n+l;p ~ — Of2n+l*«y 


But now Mj, = (gee Part IV, Section II) 

r 

(82) + 


= ma + 


2 a2 

(ToOia 


(1 + 2SxxY,hK^.x, r, s) 


[from (60)]. 


” Salvosa, L. R., Annals of Mathematical Statistics Vol. I, No. II, 1930. . 
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Upon the same assumption that the best approximated 'most probable values’ 
of the mean, the standard deviation and the skewness be the mean, the standard 
deviation, and the skewness of the parent population, the distribution of is 
characterized by 


(83) 


Ma,., = m + g^ 


2 a2 


(l + 2Sj»</)»(S8:„r,s) 


= m2<l + 


•» 2 

9 “8;* 


(1 + r,s) 


= |/ (37 + 2) 


]/ ris~ 


(3-? + 2) 


1 ) 


<^®(a3;*, r, s) 




s — 2r 
s-2 


= / s - 1 _ 30 t' + se-f + 8 

s — 2 y r(s — r) 


(37 + 2)» 


o_ , _ (s- l)(s* + s-6rs)4-6}-2)r.. oi 
- 3 - o.,, - 3 - 2) (s_3) 

OsCr — 1) (s — r — 1) 
r{s — r)(s — 2) (s — 3) 

_ (a - 1) («^ + 8 - ^rs + 6r^) f 6307^ + 1680^^ + 9127 + 18 1 
r{s - r) (s - 2) (s - 3) [ (3-f + 2)“ J 

6s (r — 1) (s — r — 1) 


r(s — r) (s — 2) (s — 3)' 


For an infinite parent population, we have 


(84) 


A 2 
«8:* 




"■pCX — ^ 


(S'f + 2) wis 


1 3(>f* + 56^ + 8 


(3^ + 2)»« 

„ _ 1 /eso-?* + 168(lf* + 912-? 4- 48 

1 7;ri ; rn; - 


r { 


m + 2y 


Numerical illustration. Using the same sample in Table VIII, we wish to 
ascertain the probability that the variance of the parent population lies between 
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306.25 and 342.25 From mi = 138,3, a, = 14.6366, as , 
4.47644, we find from (84) 


X ~ 

214 

232,235 


34 

335,74 

^3 Asias 

— , 

.495,311 


3 

463,675,7 

fiiy = 


105,515,6 


From Part I, Section III, 


h.x 


(2 + 


= .276 < 1 


.81099, and ai.j = 


Therefore, the best fitting curve will be Type IV which assumes the form^® 


(85) y = (1 -f. ajS)-”* e-'' 

where 

1 

t being in standard units 

_ hi _ aj 

^ 2b2 25 

2 45o52 — bi 46 (2 4" 5) — a* 


ill 

48* 

1 _ 1 + 26 
252 5 


a + p 
biQ' 


X TT 

2 

e 

1 

g (2m - 2, X) 

F(2m - 2, X) 


y<j is found from Pearson’s Tables for Statisticians and Biometricians^^ to be 
049662. 


Elderton, W. P., op. cit., p. 64. 

Pearson, K,, Tables for Statisticians and Biometricians, Vol. I, pp. 126-142. 



110 


CHINQ-LAI SHEN 


Now the given limits 306.25 and 342.25 of the variance, when expressed in 
standard units, are 

= 2 679,941 
k = 3.728,410 

Therefore the probability that jii-i lies between jxi.g = 306.25 and /jj.s = 
342.25 is 

r t .-3,728, 410 

P = 2/0 I (1 + a;*)"” dx 

Jl ^=.2.679,041 

we find 

m = 11.477,271 
X = 12 940,307 

/ 36343 

(1 + 6-12.940207 tan-i=. 

08767 

By means of Maclaurin-Euler’s Interpolation Formula, P is found to be equal 
to .000,904. 

No definite law can be ascertained before we know as.* because, as we have 
seen, aap,,, and ai/i,.* are both expressed in terms of s, r, and aa.*. We do 
not know the value of fc, which is a determining factor of the best fitting curve 
and a function of s, r, az-a^.^ and a 4 :/i,.*, until we know the values of s, r, 
and as,* , 


Section VI. Disteibution of the Hypothetical Thied Moments oi the 
Parent Population About Its Mean 

Recalling the fact that the distribution of the third moments of sample means 
about the most probable value of the mean of the parent population is equivalent 
to the consideration of a distribution of sample means drawn from a parent 
population, Wi, u' 2 , Ws, • ■ ■ w, where = (a;, — ilf*)®, so we can write down in 
accordance with the fundamental relations stated in Theorems II and III ; 


fXZnip — p'2n‘Xv} 

MSn-f-lip “ M2n+1 «ui 


or 


— ^2n,Zxo 


^27i+1:p 




(x — M 

But here Mp = = — ; and by the substitution of (60), we have 

'’ V ^ 1 + 2 dJ 


mz + 37712 


Id- 
s' <7, aa * 


-b 




(1 + 23*,.) <^>( 03 ;*, r, s) (1 4 - 2di^y <yiaz I, r, s) 


(86) 
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Consequently, with the same assumption that the best approximated ‘most 
probable values’ of the mean, the standard deviation, and the skewness be the 
mean, the standard deviation and the skewness of the parent population, the 
distribution of Jii x is characterized by 






+ 


^3 3 A 3 

9 iTsas 


(1 -f s) (1 + r, s) 


- t / ' 


(5-1) 


(15 + 63t -h 307=) 


r, 8) 


_ n - 2r / fi — I 
“ « ~2 y r(s - r) 

= i / s - 1 a3=»(1215 4- 64177 + 79387= + 2520f«) 

s - 2 y r(s - ?•) (15 + 637 + 307=)^ 


^ _ (s - l)(s= + s — 6rs + 6r=) , 6s(r - l)(s - ?• - 1) 

* r(s-r)(s-2)(s-3) ^ Ks - ?’)(s - 2)(s - 3) 

9720 + 4176557 4- 2707992'f= + 5840343'?^ 
(s - l)(s= + s - 6rs + 6r=) + 47895307'* + 1247400^“ 

r(s - r)(s - 2)(s - 3) (15 + OS-f + 30^)= 

6s(?’ — 1) (s — r — 1) 
r(s — r) (s — 2)(s — 3) ‘ 


For an infinite parent population, we have 


Mi 


WJs + Sma 


O'iCls X 


2r4>(a3 X, r) 


+ 


1 3 •* 3 

<r» «3 

Sr* (j>Ka3 x, r) 


( 88 ) 



+ 63f + 30f^) 

4>\cizx, r) 

aa ,(1215 + 6417-7 + 79387=* + 2520-/^) 
(15 + 637 + 30f=)5 


Ut.X 


9720 + 4175557 + 2707992-?^ + 5840343t 1^ -f 4789530-?* 
1 + 1247400-y® 

r (15 + 63^ 4- 30f )= 


Numerical illustration. Using the same sample in Table VIII, we wish to 
ascertain the probability that the third moment of the parent population about 
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the mean lies between = 3000 and As* - 4000, still assuming an infinite 
parent population from which the sample is drawn 


aa:i = 


,6501 


^iai„r) = 1.006,279 


We find from ( 88 ) 


M;,, = 2558.137,096 
a;,, = 1675 696,37 

-1 187,409,9 
6.127,551,6 
5 ,/ = 0 221,886 

k = 0.714,972 < 1 


Therefore the best fitting curve is Type IV. 

From Pearson’s TaUes for StatisUdms and BiomeirimnSt Vol. we compute 

yp = 000,058,032,3 

The given limits 3000 and 4000 when expressed in standard units are t - 
.263,689 and t = .860,455 respectively. Therefore the probability that /laai 
lies between 3000 and 4000 may be expressed by 

P^yJ (1^3.2)-6mi9g-17 143447 tan*.. 

26.3689 

By means of Maclaurin-Euler’s Interpolation Formula, the answer is found to 
be 267,408,631, 

We make the same remark here as we have made in the preceding two sections. 
That is, since 0:3 and 04 .^,, are both in terms of s, r and we cannot 
determine the value of k which is a function of ^3 p,.,. and 04 p, ,, until we know 
the values of s, r, and ^ 3 ., . Consequently, the curve associated with the dis- 
tribution of the hypothetical third moments of a parent population of Type III 
distribution is not known until we know s, r, and as’x . 


Pearson, K , op. ciL, pp. 126-142 
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ON A METHOD OP TESTING THE HYPOTHESIS THAT AN OBSERVED 
. SAMPLE OF n VARIABLES AND OF SIZE N HAS BEEN 
DRAWN FROM A SPECIFIED POPULATION OF THE 
SAME NUMBER OF VARIABLES 

By John W. Pbetig 

With the Technical Assistance op Mabqabbt V. Lbabt* 

The problem of determining whether or not a given observation may be 
regarded as randomly drawn from a certain population completely specified with 
respect to its parameters is readily solved if the probability integral of that 
population be known. In particular if the population specified be a normal 
population, one may calculate the relative deviate {x — a) /it, where a and it 
are the population mean and standard deviation respectively,' and refer to tables 
of the normal probability integral. The hypothesis that x was drawn from 
this population may be rejected if P is less than an arbitrarily fixed value, 
say ^ .01. Generalizations of this problem may be made in two directions: 
1) May a single observation simultaneously made on n variables be considered 
as randomly drawn from a specified population of n variables? 2) May a 
sample of one variable and of* size N be regarded in its entirety as randomly 
drawn from a specified univariate population? 

The solution to the first problem for the case of sampling from a normal 
population of n variables was given by Karl Pearson in 1908^ as the “General- 
ized Probable Error.” Let 

L (Tiff, Jj 

where o, and Vt are the population mean and standard deviation respectively 
of the variable, and Pi, is the usual cofactor of the element in the row 
and column of the determinant P of population correlation coefficients. 
That is, 

P = \pif\lhj = 1, 2, 3, ,n. 

The probability of an observation yielding a smaller discrepancy than that 
represented by the value of i-e., lying between 0 and x^ may then be 
calculated from Tables of the Incomplete Normal Moment Functions^ The 
tables are entered in terms of (x^)* and (n — 1), and the tabled value multi- 
plied by (2 t)^ or 2 depending upon whether n be even or odd respectively. 

* From the Memorial Foundation for Neuro-Endocrine Research and the Research 
Service of the Worcester State Hospital, Worcester, Massachusetts. 
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The probability of an observation, giving a greater discrepancy is then, the 
complement of this value. Obviously, this latter probability may be obtained 
directly by entering tables of the distribution such as Elderton’s® with n 
degrees of freedom, or through the use of Tables of the Incomplete T-Function^. 

The second problem, limited to the case of sampling from a normal popula- 
tion, was investigated by J. Neyman and E. S. Pearson in 1928®. The observed 
sample may be regarded as a point in iV-dimensional space, where N is the sample 
size. Criteria for the acceptance or rejection of the hypothesis may be asso- 
ciated with contour surfaces in this space, so that in moving out from contour 
to contour the hypothesis becomes less and less reasonable. Frequently, con- 
tour surfaces on which the mean or standard deviation is constant are used for 
the testing of this hypothesis. Such surfaces are deficient inasmuch as they 
are not “closed” contours. Another contour system which appears more satis- 
factory is that of equiprobable pairs of m and s. The latter system in fact 
encloses roughly the same region as do the separate contours for the means and 
standard deviations. These systems are of course dependent on the particular 
statistics chosen to describe the sample and are further limited in that they do 
not take into account the probability of alternative hypotheses concerning the 
origin of the sample. 

Using the principle of maximum likelihood Neyman and Pearson have devel- 
oped a system of contours which is free of the above limitations. The system 
so derived is in fact quite similar to that of equiprobable pairs m and s. In a 
later paper®, these same investigators have shown that this method of maximum 
likelihood does enable one to select the most efficient criteria for the testing of 
an hypothesis. The criterion selected on this basis is defined as 


X = 


Likelihood that sample came from specified population 
Maximum likelihood that sample came from some other population 


= (sV(r2)^/2e-'^/2 


-f (x — a)^ 


- 1 


where a and <r are the population mean and standard deviation respectively, 
and X and s the sample mean and standard deviation. 

X is constant upon certain contour surfaces in iV-dimensional space, and dimin- 
ishes on passing outward. The form of the surfaces is independent of N. It 
is evident that X must lie between zero and unity. When it is close to unity 
we know that it is reasonable to assume that our hypothesis is true, when 
small we know that it is unreasonable. But we must know the probability of 
X less than a certain value occurring when the hypothesis tested is true, so that 
we may control another source of error, namely, that of rejecting the hypothesis 
when it is true. In other words, we must know the sampling distribution of X, 
so that we will reject the hypothesis only when the probability of obtaining a 
smaller value is negligible, say Px ^ .01. Neyman and Pearson were not able 
to evaluate this distribution but they were able to integrate the original density 
function of the population appropriate to N-dimensional space outside of the 
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various X contours. This they were able to do by effecting a transformation 
of the density function and contours to the plane of m and s. These values 
of P\ have been tabled by them'', the tables being entered in terms of N and 
Jc, where 


k = log 




s'* -j- (x — a)* 


— log (sVv^) 


The generalization of either of the above problems requires a criterion to 
test an hypothesis which may be formulated as follows: Given a sample 2) of n 
variables and of size N with means xi, Xi, ■ , x„, standard deviations 
Si, Ss, • • • , Sn, and correlation coefficients ru, r^, • • ■ , ru, r^, • • ■ , r 2 „, ■ ■■ , 
may we regard this sample as randomly drawn from a population t of n varia- 
bles and completely specified with respect to all its parameters? We shall 
restrict our inquiries to the case where w is a normal population. In this case 
the distribution law is 


f(Xi, X2, ■ • , Xn) = 


(27r)”^^xi<ri ■ ■ ■ cr„P* 


where 




where and cr, are the population mean and standard deviation respectively 
of the f*** variable, and P and P,, are as previously defined 
Thus the probability that S has been drawn from tt with its N values of 
Xia (i = 1, 2, • • • , n) lying in the interval x,a ± ^dxra] (a = 1, 2, • ■ • , W) is 
given by 


where 




(2ir)''/VlV2 • • • (TnPK 


UX 


e = 


1 

2P 


n N 

S Pi, s 

t,!-l 0I=.1 




K 

2P 




a,){x, 





dX == fl f[dx 

t-1 a-1 


ta 


The likelihood that S has been drawn from any other normal population, 
such as tt', is given by 
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where 


0 ' = 


N 


2P' i.,7-1 




4- ~a[){x,-a\) 


The population from which it is most likely that S has been drawn is that 
for which C is a maximum. The values of the parameters of this population 
may be obtained by putting 




0 , 


dC 


bC 


9(r, 


7 = 0; (i = 1, 2, ■ . . , n) 


1 = 0; (i, j = 1, 2, . . . , n) 

9Pt, 

These conditions are fulfilled when 

(ij = (Ti = Sj, (i = 1, 2, • ■ • j n) 

p.', = (by = 1. 2, , n) 


So that 


where 


=r 


(2t)"'‘'SiS2 • • • SnK^, 




p = 1 n, 1 ; t, y = 1, 2, . . . , n 

The appropriate criterion to select in order to test our hypothesis is thus 

Cmax Uicrj • • • (r„pij 


where 


" " 2 ■? L s;; 

The equations X = constant represent a series of contours in JV-dimensional 
space. As we move outward from contour to contour our hypothesis becomes 
less and less acceptable. Although we may be confident that the use of this 
criterion will minimize the chance of accepting the hypothesis when it is false 
we must know the frequency with which samples occur outside of a given X 
contour when the hypothesis is true. In other words, we must know the inte- 
gral of C outside of various contours, or else we must know the sampling distribu- 
tion of X. The former is an exceedingly difficult method for n greater than 
unity. Thus for the ease of n = 2 we should have to integrate some such 
expression as 

y— 4 

fcs^““s2~^e“®(l — rjj) * dx^dXidsidSidrii 
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outside of the various contours. Nor have we so far been able to evaluate the 
sampling distribution. We can however give an expression for the moments 
of X and thus reach an approximate distribution. 

Wilks® has derived expressions for the moment coefficients about zero for 
the maximum likelihood criterion that k samples of n variables and of Nt 
observations each have been drawn from the same unspecified normal popula- 
tion of n variables. Thus, 


^:(x) 


k 

~ k 

h » W < „ 

n 

S Nt 

■ n 

(=1 

L AT. J 

1*1 




(1 + ii) - 


i^) 



n 

1 = 1 


r 

J 

;S Nt~i 

V 

L 2 J 

T 

k 

(l + h) S Ni- fl 
(=1 

L 2 JJ 


from which we can write expressions giving the moment coefficients about zero 
for the X criterion for two samples 


Ma(X) = 


nh(Ni+Ui) 

(N^ 4- N,) " 

nhl^t nUNj 

Ni ' iVs ' 


n 


^ j 

r 

J _J 

[N,il+h)-z'\ 

1 

[N^il + h) -i] 

|r 

[Ni -f V* - 

ill 

L 2 J 

L 2 

L 2 Jl 

r| 

\ 



~{Ni + N,) il + h) - f 
2 J 



The limit of this latter expression as iV 2 — > « will be the moment coefficient 
about zero for the X criterion that one sample has been drawn from a specified 
population. Thus 



Various roots of X are distributed to a good degree of approximation according 
to a function of the form 


“ r(mi) Tim) ^ ’ 
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where 

= Ml (mi — M2)/(m 2 — Ml ) ; m2 = (1 — Mi)OTi/mi 

and the value of mI for roots of X may be obtained by replacing h in the original 
expression by h times the desired root. Measures of the skewness and kurtosis 
of this distribution are given by 

Bi — 4(mi - + ma -f- l)/mim2(mi + m 2 + 2)“ 

J ?2 = 3Bi(mi -|" ^2 2) 6(mi -b m 2 d- l)/2(mi m 2 -)- 3) 

A comparison with the true measures of skewness and kurtosis for various roots 
of X as given by 

= M3/M2 I -®2 = M4/M2 

will afford a measure of the goodness of the approximation and the range of 
values of N for which any particular root will be distributed as assumed. 

Investigating the moments for n from one to four and N from three to fifty 
we note that in the case of samples of two and three variables, follows the 
assumed distribution for N from 3 to 15; X^^^ from 15 to 30; X®/"^ from 30 to 50. 
In the case of four variables, X'^^" follows the distribution for N from 5 to 10; 
V-in from 10 to 20; X^"' from 20 to 40; from 40 to 50. It appears likely that 
for higher values of n, for IV small, some such root as X^^®^ or X’^*^ will follow the 
assumed distribution, while as N increases smaller roots will follow it. For 
any value of n, the smallest permissible value of N is (n 1) . 

The probability that a smaller value of X will be obtained when the sample 
has actually been drawn from w, i.e., Px, may thus be obtained by reference to 
Tables of the Incomplete P-Function® with p = mi, q — x = value of the 
particular root of the observed X. We may also get the 1% and 5% levels of 
significance directly from Fisher’s'® tables of “z” or Snedecor’s" tables of “F” 
(— by taking 


ni = 2m2 ; n'i = 2mi ; L = rii/ (ui + riiF) , 

where L is the desired root of X. Linear interpolation will generally sufldce 
except for very small values of N. 

.For the case of N —> « , we have 

n+l 

Lim. mI(X) = (1 + h) 

K-*oo 

Thus the quantity (—2 log X) will be distributed in the distribution with 

n 

<S { degrees of freedom. 
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A table of the 1% and 5% levels of significance for n equal one to four, and 
values of N from five to « is given below 


and 1% Levels of Significance of “X" 
— N — 


n 


5 

10 

15 

20 

30 

40 

60 

00 

1 

5 % 

.025 

.037 

041 

.043 

.045 

.046 

.047 

.050 

1 % 

.003 

006 

008 

.008 

.009 

.009 

.009 

.010 

2 

6% X 10-» 

.046 

.173 

234 

.269 

.308 

.330 

.343 

.392 

1% X 10 -» 

026 

.168 

,260 

.305 

.372 

409 

.428 

.525 

8 

5% X 10-“ 

.001 

036 

.072 

.097 

.125 

.143 

.165 

.211 

1% X 10'* 

.000*- 

.019 

.047 

076 

.101 

.117 

,128 

.194 


6% X 10-‘ 


.026 

.106 

.174 

.295 

.356 

418 

.710 

*4 

1% X 10'* 


007 

.040 

.075 

.145 

.185 

.221 

.466 


A check on the accuracy of the method of approximation used may be obtained 
by comparing the values of P\ for the case of n = 1 with the exact values given 
by Neyman and Pearson. For n = 10, is distributed as assumed with 
mi = 9.0562, m 2 = 0;9987. For the case of (x — a)/ a = 0.2, s/v = 1.2, we 
find k = 0.48439, X^" = .94395. From the Tables of the Incomplete B-Func- 
tion we find Px = .5936, from Neyman and Pearson’s tables, .5936. 

No studies have been made on the extent of deviation from normality per- 
missible for the application of the test. There is no reason to doubt, however, 
that as much deviation is permissible as in the case of the univariate X. From 
theoretical considerations and from sampling studies Neyman and Pearson con- 
clude that the univariate X technique holds for deviation from normality to 
the extent of ±0.5 for Bi and 2.5 to 4.2 for 8%. 

We are confident that this generalized X techmque will be found useful in 
biological research. If the 71 variables were uncorrelated we would be able to 
test whether the sample had been drawn from the population of n variables by 
successive applications of the univariate X techmque and then combming the 
resulting probabilities. In general, however, there -will be some correlation 
between the variables, however slight. The method here proposed will take 
account of all possible intercorrelations, and consequently all multiple and 
partial correlations. 

Now, if P\ is less than some arbitrarily fixed value, say ^ .01, we may decide 
which variable or variables contributes most to this result, by performng 
simpler X tests. It may be due to one or more of the means, standard deviations. 
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or correlation coefficients. As may often be the case, it is not due to any one 
factor but to contributions from all of them. That is, all possible factors 
tested separately might show a fairly reasonable value of P, but if all the 
separate values are combined somehow, as by means of this X method, the 
resultant P may be too small. It is in such problems that this technique should 
provide valuable information. 

In case k samples of n variables are available it should be possible to deter- 
mine whether all of them have come from the same specified population of n 
variables by performing k X tests and combining the separate values of Pi. 
Such a hypothesis may best be tested, however, by a further extension of the X 
theory which the writers are at present investigating. 

The following problem is chosen to illustrate the computations involved in 
the application of the test, Many of the investigations pursued at the Wor- 
cester State Hospital attempt to differentiate between schizophrenic patients 
and normal controls. In one such type of investigation various blood constit- 
uents were determined, namely, Urea N 2 (mg./lOO cc.), Uric Acid N 2 
(mg./lOO cc.), Creatine Nj (mg./lOO cc.) for a sample of twenty-five schizo- 
phrenic patients. Previous investigations on these same variables for a large 
series of normal controls yielded constants which for the purpose of the 
example may be considered as the population parameters. Past studies on 
these variables have not shown any marked degree of non-normality for the 
various distributions. 

These variables are designated as 


1 = Urea Nj ; 2 — Uric Acid N 2 ; 

The parameters of the population are given by 
ai = 16.03 ; aa = 1.40 ; 

= 20.268 ; 4 = 0.029 ; 

P 12 == .3075 ; Pis = .1232 ; 

The statistics for the sample of twenty-five are 
x\ = 15.56 ; X2 = 1.42 ; 

si = 10.486 ; si = 0.043 ; 

ri2=-.0161; ri 3 = .0925; 


3 = Creatine N 2 


03 = 1.25 
xl = 0.025 
p23 = .3853 

= 1.25 
= 0.025 
r23 = .2174 


None of these statistics differs significantly from the corresponding parameters. 


R = 0.9443 ; P = 0.7710 ; 

Pn/P = -0,3373 ; Pi,/P = -0.0061 ; P^/P = -0.4506 ; 

Pii/P = 1.1045 ; P22/P = 1.2773 ; P33/P = 1.1744 

w = 12.5 (0.3802) = 4.7531 
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(4 si si R/al 4 4 P) = 0.9001 
log X = 12.5 log (0.9001) - 4.7531 log e ^ 3.3641 
X = .0023 

Since the 5% level of significance is about .0001, we thus conclude that the 
patients are not differentiated from the control population with respect to these 
variables. 
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ON CONFIDENCE RANGES FOR THE MEDIAN AND OTHER 
EXPECTATION DISTRIBUTIONS FOR POPULATIONS OF 
UNKNOWN DISTRIBUTION FORM 

By William R. Thompson 

About the commonest situation with which we are confronted in mathematical 
statistics is that wher6 We have a sample of n observations, (a:,}, which is 
assumed to have been drawn at random from an unknown population, U, with a 
zero probability that any two values in the finite sample be equal; and we 
desire to obtain from this evidence some insight as to parameters of the parent 
population, U. If further assumptions are made as to some of the parameters 
or the form of U, there may result a gain in power in testing other given hy- 
potheses or establishing confidence ranges for particular parameters, but at an 
obvious sacrifice of scope in application. Insistent problems involve estimation 
of mathematical expectation that in further sampling we shall find x lying within 
a given interval, or similar expectation with regard to parameters of U such as 
the unknown median. It might seem that, without further assumption, all 
we should claim is that it is possible to draw from U the sample actuahy ob- 
served. A mere description of the experience may well be considered the 
observer's first duty, but a restriction to this would leave entirely unused the 
quality of randomness which has been assumed. What additional statements 
as to 1/ may be appropriate in view of this randomness are our immediate 
concern; and the object of the present communication is to show how we may 
obtain such expressions in the form of mathematical expectations, and to 
present some results. Widespread applications to problems of estimation of 
normal ranges of variation or specific confidence ranges and comparisons of 
sample reflections of possibly dififerent populations are immediately suggested, 
and a new foundation is offered for the study of frequency-distribution from the 
point of view of Schmidt.^ 

Section 1 


Accordingly, consider the following situation. Let A = (a:) denote the set of 
all real numbers; and U denote an unknown frequency-distribution law of draft 
from {a:} such that there exists an unknown function, f(x), bounded and not 
negative in A, and that the probability of obtaining x in an arbitrary interval 
(a, |3) is 




‘Schmidt, K., Annals of Math, Slat,, 5, 30, (1934). 
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and, for every positive p < 1, there exists a finite interval (a, 0) such that 
P{a < on > p. Let U be called an infinite population; and let n drafts, 
independently thus governed, made from A without replacements be called a 
random sample of n observations from U. Let S = {xk},k = 1, ■ ,n, denote 

such a sample; the enumeration to be made in an arbitrarily determined manner. 
In any case x, 5 ^ for i 7 ^ j. 

Temporarily, let us consider k to indicate the order of draft of the values of 
{xk}, and let p* = P{x < Xk) denote the probability that x, drawn at random 
from U, be less than Zk of S. The probability d priori (i.e., without regard to 
relative values of z in the sample) that in such random sampling ps lie between 
p' and p", where 0 g p' < p" g 1, is obviously independent of k, and 
equals p" — p'; i.e., pk is equally likely d priori to lie in either of any two equal 
intervals in its possible range, (0, 1). Furthermore, the probability that in 
the rest of the sample, S, there will be just r values less than Xk is 

(” 7 ^) • P* • (1 - , 

where r is an integer and 0 ^ r < n. Of course, p* is unknown; but we may 
calculate (for all cases in repeated sampling wherein the same value of r is 
encountered) the expectation, Tr ip' < Pk < p"), that pk lie in the interval 
ip'iP")- This is given by 

(2) Pr(p' <pk< v") = p’--q>-dp, 

where s = w — 1 — r, and q== 1 — p. This is a familiar result*- *■ ^ in applications 
of the well-known principle of Bayes to estimation of d posteriori probability. 
The approach is convenient in that many relations which have been developed 
in this connection are made immediately available. However, that pk is 
equally likely d priori to lie in either of any two equal intervals in its possible 
range, is not based in the present case upon an especially added assumption 
nor any plea concerning equal distribution of ignorance, but follows directly from 
the elementary assumptions of random' sampling. Accordingly, we are enabled 
to develop for given ranges what may be called the specific confidence or mathe- 
matical expectation that a given variable lie therein. 

Obviously, (2) does not depend on fc if this index is the order of draft provided 
that just r values of the sample, S, are les^ than the one under consideration, Xk. 
To simplify notation, accordingly, let the index k for any given sample, {x*}, 


* Bayes, Philosophical Transactions, SS, 870 (1763). Cf. Todhunter, I., “k. History of the 
Mathematical Theory of Probability,” Macmillan and Co., London, 1866. 

’ Laplace, “ThSorie Analytique des Probabilitfis,” Paris, 1820; and other works, Cf. 
Todhunter, l.c. 

* Pearson, K., Philosophical Magazine, Series 6, Vol. IS, 365, (1907). 
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be deternoined by the relations, Xi < a:,- for i < j, where k = 1, • - n. Then, 
by (2) as A; = r 4- 1, we have 

(3) P(p' < p. < p'O - f’ p" -r-'-ip, 

where pk is the probability that random sample values from U will be less thaii 
the fc-th value in order of ascending magnitude from a given random sample, 
[xh], of n values from U) and P(p' < pk < p") denotes the expectation that in 
such sampling pk wiU lie in the interval, (p', p"). 

In general, let E{w) ^ w denote the mathematical expectation of any 
variable, w, under the given sampling conditions. Then, from a well-known 
relation developed by Laplace, we obtain from (3) the mean expectation of pk, 


(4) 


Pk = 


n+l’ 


and, further relations'* of Karl Pearson yield 
(5) E((p, 




i.e., the mean squared error in systematic use of 


(n-f l)2-(n + 2) ' 
k 


n -f 1 


instead of the unknown 


Pk should have the value in (5). Specific confidence ranges for a: are readily 
established; e.g., the expectation that in random draft from U we obtain x 
within the range {Xk, x„-k+i) in view of the sample, S, is 


( 6 ) 


Tixk <x< x,^k+\) = ” it - for 21b < n -f 1 ; 

■re -p 1 


— — k 

and P(x < Xi) = P{x > x„-k+i) = — r-r. For a given variate, w, the range 

re -b 1 

(a, /3) will be called central iiT{w < a) = P(w > 0), as in the case under (6). 
This is in accord with the development of the subject of confidence ranges by 
Neyman®'® and by Clopper and E. S. Pearson’ following the introduction of the 
notion of fiducial interval by R. A. Fisher.®’® The estimates of p* in (4) may 
be of value in studying frequency-distribution from the point of view developed 

• / k \ ' /2k — l\ 

by Schmidt,' by comparison of Xk with M .) rather than 41 — rr ) where 

\re -I- 1/ \ 2n / 

V' is a univariant inverse of the integral of a given frequency function, taken to 


' Neyman, J,, J. Boy. Stat. Soc., 97, 689, (1934). 

* Neyman, J., Annals of Math. Stat., 6, No. 3, 111, (1935). 

I Clopper, C. J., and Pearson, E. S., Biometriha, SB, 404, (1934). 

* Fisher, E,. A., Proc Canib. Phil. Soc,, 26, 528, (1930). 

' Fisher, R. A., Proc. Roy. Soc., A 139, 343, (1933). 
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replace the unknown /(a;). Obviously, P{xi, < x < x/,+i) = A discus- 

n + 1 

sion o£ the special case, n = 2, has been prominent recently in a controversy 
between Jeffrey^® and Fisher®' and in an article by Bartlett. 

Now, in (3) for p = p', and p" = 1; we may write^® 


(7) P(p < Pk) 



— Ig(Tl — ^ “h l^ 1^) 


Bgjn — h + l,k) 

Biin — k -{■ I, k) ’ 


where ? = 1 — P, and the incomplete B and I functions are those of K. Pearson*® 
and Muller.*^ Now, let M be the unknown median of the infinite population, 
U. Then, by definition of p*, if and only if Xk > M, then p*, > Therefore, 

(8) P{M < X,) = P(0.5 < pd = Q"- 2 (”) ^ -k + l,k). 

, and the expectation that M lie 
between the fc-th observations from each end of the set, S, is given by 

(9) P{xic < M < a:n_)i+i) = 1 — 2-Ui{n — k + 1,-k), for 2fc < n + 1. 
Obviously, this confidence range is centred. 


Obviously, P(xk < M < xji+i) 




Section 2 

Now, consider another infinite population, U'. In similar manner we may 
develop expressions for confidence ranges and distribution expectations. Let x' 
be the variate, and consider a sample. S' = {x^}, of n' observations drawn 
ivithout replacements from A according to U' but after the sample, iS, of U] 
i.e., so that no two of these sample values in S' are equal, nor any of them equal 
to a value in S. Furthermore, let m be the order of ascending magnitude of x' 
values in S'-, and p^ = P^x' < x^) for x' drawn at random from U', and let M' 
be the unknown median of U'. Then, by replacement of x, n, pk, k, and M hy 
x', n', Pn, m, and M', respectively, in relations already developed for U and 8, 
we obtain corresponding expressions for U' and S'] e.g., 

(10) P(a:^ <x' < xln) = i . 


'“Jeffreys, H., Proc. Roy. Soc., A 138, 48, (1932); A HO, 523, (1933); A W, 9, (1934); 
Proc. Camh. Phil Soc., S9, 83, (1933). 

“ Fisher, R. A , Proc. Roy. Soc., A U8, 1, (1934). 

“ Bartlett, M S., Proc. Roy Soc., A HI, 518, (1933). 

Pearson, K., Biometrika, 16, 202, (1924). 

" Mliller, J. H , Biometrika, £3, 284, (1930-31). 
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Now, let the index values, fcm, be defined as the number of values of {a:*} that 
are less than ■■■ , n\ Then, for all realized cases, 

(11) <xL< , m = 1, ■ . ■ , n', 

for the extreme members of (11) in S. Then, for x and x' drawn at random 
from U and U', respectively, we may write 

(12) 0 < (n + l)(n' + l)-P{x < x') — km < n n' 1, 

m«= 1 

provided that the expectations for U and JJ' may be treated as independent. 
Similarly, for P{M < M') we have the relations, 

S - km + I, km) < 2^'z -PiM <M') <1 

(13) ^ 

~ + !)■ 

«»= 1 ' ' 

Of course, lo i(n + 1, 0) s 0, and /o.6(0, w + 1) = 1. It may be verified readily 
that the inequality relations of (12) and (13) provide best upper and lower 
bounds for F(x < x') and P{M < M') under the circumstances given. 

Obviously, any increasing function, ^{y), for y in A, may be used throughout 
the arguments, with 0(t/) replacing y — x, Xk, M, x', xl, M', respectively. 


Section 3 


Consider, now, the case of a finite population, Uk, of real numbers 
a:W < for f < j, i = 1, • • • , iV. Assume that N is known, and that a 
sample, S, of n values has been drawn at random from Uk without replacements. 
Let the sample values be {xt) , fc = 1, • • • , n; and k be an arbitrarily determined 
index. As before, we might consider k the order of draft, temporarily, but .the 
same analysis may be made if we let k be the order of ascending magnitude in 
the sample, S, and disregard its value in connection with d prion estimates of 
draft probability. Each Xk = for some unknown Uk = 1, N; and, 
d priori (i.e., with no knowledge as to order of magnitude of other values in the 
sample), any two of these values are equally likely. Obviously, this is so if Xk 
is the first value drawn from f/y, and the rest of the sample may be regarded 
as a random draft without replacements of n — 1 elements from [Uk — a;*].- 
Let>.r be the number of these sample values less than xu, and s = n — 1 — r. 
Then the probability of drawing such a sample after the given Xh, under the 


conditions given, is 


TrT-.l 


A 

/N-l\ 


, where ui, ■ 


1 is the unknown number of 
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values in that are less than Xk. To estimate the expectation, = mj, — 1), 
that there are just a given number, R, of values in f/w less than Xk; we encounter 
the same situation considered by K. Pearson in a paper^^ subsequent to those 
applied to the infinite universe; and, by a simple conversion in notation, we have 

(14) P(B - u. - 1) - 


In previous communications^®-” I have defined a function, 



(15) 


\p(r, s, r', s') s 


r* 

?( r -f- + 1 + « 

/r + s + r' + s' + 2\ 

\ r+s+1 } 


for any four rational integers r, s, r', s' ^ 0 ; and shown that Pearsons further 
result, equivalent here to evaluation of P{uk g E + 1) for a given R, may be 
expressed by means of this \t--function. Thus, we have 

(16) Kuk ^R + i) = ^{r,s,B~r,N -R-s~2). 


It was demonstrated also'®-” that 


(17) i/'(r, s, r', s') = 4>(r, r', s, s') s ^(s', r', s, r) s 1 _ ^{s, r, s', r') 

with extension of the definition to include ^(r, s, — 1, s') = 0, and that 

A+ ?■'+ lYA + 1 

^ \r+l4-a/\ s — a 
-f- 5 q- r' “h "h 2 

r + s + 1 

As in the case of the infinite population, here also it is obvious that the order 
of draft of Xfc is of no consequence in the analysis ; and again we will let fc = r + 1, 
whence s = n — k, and we may make these substitutions in (14) and (16). 
Then, we may write 

(19) P(v.k ^ R) — ^p{k — 1, n — k, R — k, k + N — R — n ~ 1); 




“ Pearson, K , Bxometrika, SO A, 149, (1928) . 

Thompson, W. R., Biometriha, SB, 285, (1933). 

” Thompson, W R , American Journal of Mathematics, 67, 450, (1935). 
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and, obviously, P(m„_ 4 +i ^ iV - 22 + 1) = P(uk ^ 22). Hence, if we let M 

N ~ a 

be the unknown median of Uir; and m = — - — , where a = 0, 1, and - a is 
even; then, as «< is an integer, 

P(Xk g M g Xn-h+l) = 'p(uk ^ Rn-Hl) 

( 20 ) \ 2 / 

si- 2*^(ib - l,n-- k,in - k,k-\- N - m ~ n ~ 1) , 

which is the expectation that the median oi Un lie within the closed interval, 
{xk, Xn-kn), for 2fc g n 4- 1. This gives the confidence range, analogous to 
that for the infinite universe. It may be noted that 


^ 22 < Uk^i) = P{uk g 22) - P{uk+i S 22) 

= ^(r, s, r', s') - + 1, s ~ 1, r’ - 1, s' + 1) 


where r - k - I, s = n - k, r' E - k, md s' = k N - E -- n - 1. 
Hence, (18) gives 


( 21 ) 


P(uk ^ 22 < Mjfe+i) = 



The approach by way of Pearson’s problem again makes it easy to evaluate 
the expected mean pk and variance as in the case of the infinite population, 

y^u — 1 

where pk = P{x< Xk) for x drawn at random from U n. Of course, pk- 


but Uk is unknown. From Pearson’s result,?' however, we obtain 


N 


( 22 ) Pk 

and the expected variance of pk, 


k{N 4- 1) - n - 1 _ k 
N{n 4' 1) n 4- 1 


N) 


ri- 


fe- 1 
N 


(23) 


= Eiivk ~ pk)^) = 


fe(n - fe 4- 1)(2V 4- 1)(JV' - n) 
(n4- l)2.(n4-2)-F 


Yale University. 



THE SAMPLING DISTRIBUTION OF THE COEFFICIENT OF 

VARIATION 

By Walter A. Hendricks with the assistance of Kate W. Robey 
National Agricultural Research Center, Beltsville, Maryland 

The coefficient of variation does not appear to be df very great interest to 
statisticians in general However, its use in biometry is sufficiently extensive 
for some knowledge of its sampling distribution to be desirable. The present 
paper is an attempt to satisfy this need. 

For the purposes of the following discussion, the coefficient of variation may be 
defined as the ratio of the standard deviation of a number of measurements to 
the arithmetic mean: 



( 1 ) 


As is well known, the probability that the mean of a sample of n measurements, 
taken at random from a normal universe, lies between x and x + dS and that the 
standard deviation of the measurements in the same sample lies between s and 
s + ds is given by the relation: 


dF: 




i , 8 


2in-l r 


n - 1 


e s’' ^dxds. 


.( 2 ) 


If equation (2) is expressed in terms of polar coordinates by means of the 
transformation: x = p cos 0; s = p sin S, it becomes a distribution function of p 
and 6 in which 6 = arc tan v : 


dP 


/) .9 “ 




24n-l p 


n - 1 


— — ^ p cos fi + m*) 

e 


pn-l gjjin-2 g (3) 


In equation (3) , p may vary from 0 to « and 6 may vary from 0 to ir. To find 
the distribution function of 9, all that is necessary is to write: 


dPi = Ic 



po-i dp 


dd 


( 4 ) 
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in which, 


ni» 


2J"-i iri <T” 


— m* Bm’j . „ 

e sin"-2 0, 


a — — , and b = w cos 

2‘ ff’ 2‘ (T ' 


and to perform the indicated integration. 

To evaluate the integral inside the brackets in equation (4), we may write: 

r dp = -1 r e-“’ y bi du.... (5) 

jo a"j_6 

Consider the integral, J 6““’ du. If b is sufficiently large, as is the 

case when the parameters of equation (2) are of such magnitude that practically 
the entire volume under the frequency surface lies to the right of the s axis, that 
is to say, if negative and small positive values of x occur so infrequently that their 
effects may be neglected, the lower limit, —b, of this integral may be replaced 
by _ CO without introducing any appreciable error. The value of the integral, 

j e““’ u*-^-^du, is zero when n — 1 — i is odd and when n — I — i 

is even, zero being counted as an even number. 

Subject to the above condition that h be sufficiently large, we may, therefore, 
write equation (5) in the form: 



g~(ap-6)’ pW-l 


^ a” ^ (re — 1 — z)! i! 



( 6 ) 


in which the symbol, indicates that the only terms entering into the summa- 
tion are those in which n — 1 — z is an even number. 

Substituting this expression for the integral inside the brackets in equation 
(4), replacing h, a, and b by the quantities which they represent, and writing V 

in place of the ratio, — , we obtain the following distribution function of 0: 


dF, = 


*' K V) 


--2-sin=9 . „ 

e sin"-^0 2j 

i=0 


(re 






(re-l-z)!z! 


cos' 6 dd. (7) 


Equation (7) may be written in terms of v, if desired, by making the substitu- 
tion, $ = arc tan a; 
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dF„ 



n 


yn-2 

{f+tP)U 


Tl-1 

S' 


t-O 


(n— 1 — ^)!^! 2** 


dv. . (8) 


It must be emphasized that equation (8) has been derived on the hypothesis 
that negative and small positive values of z occur so infrequently that they may 
be neglected. However, since this condition is satisfied in the vast majority 
of practical problems in which the coefficient of variation is likely to be used, 
the limitation is not of much practical importance. 



00 01 02 0,3 04 05 06 07 08 09 

VALUE Of V 

Fig. 1. Observed and Theoretical Distributions of Values of v for 512 Samples 
OF Numbers of Heads Appearing in Two Successive Tosses of Ten Coins 


As a test of the validity of equation (8), the authors calculated 512 coefficients 
of variatibn of. the numbers of heads appearing in two successive tosses of ten 
coins. The coins were tossed 1024 times, thus yielding 512 samples, each con* 
sisting of two observations. For these data we have m = 5, <r = 1.581, and 
y = 0.3162. 

For the case, n = 2, equation (8) reduces to: 


dF, 



i 

K’ 1 + ®’ 


dv 

(1 + 


(9) 


Figure 1 shows the distribution of the 512 values of v obtained from the coin 
tossing experiment, together with the theoretical distribution given by equation 
(9). 

An inspection of Figure 1 indicates that the agreement between the observed 
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and theoretical frequencies is fairly good. An application of the familiar chi 
test for goodness of fit showed the agreement to be rather poor. According to 
this test, the degree of discrepancy between theory and observation could have 
arisen by chance less than once in a hundred trials. However, the discrepancies 
may be partly due to the fact that data distributed in a discrete fashion were 
treated by methods appropriate to the analysis of data distributed according 
to a continuous frequency curve. 

As another test of the validity of equation (8), the authors calculated 149 
coefficients of variation of “days to maturity,” which is the length of time elaps- 
ing between the date of hatch of a chicken and the time egg production com- 



VALUE OF V 

PiQ, 2. Observed and Theorbticai, Distributions or Values or v for 149 Samples 
or “Days to Maturity” in Rhode Island Red Pullets for Samples of Two 

Observations 

mences, for samples of two observations made upon Ehode Island Red pullets. 
Figure 2 shows the observed distribution of the 149 coefficients of variation, 
together with the theoretical distribution given by equation (9) . 

In applying equation (9) to these data, the parameter, V, had to be evaluated 
from the data. The best estimates of the values of m, cr, and V which could be 
obtained from the 298 measurements of “days to maturity” are m ~ 210.477, 
a = 18.6991, F = 0.0888415. The theoretical distribution shown in Figure 2 
is based on this value of F. 

The agreement between theory and observation shown by Figure 2 is very 
good. In this .case, the chi test showed that the degree of discrepancy en- 
countered could have arisen by chance about six times in ten trials. 



SOME NOTES ON EXPONENTIAL ANALYSIS 
By H. R. Grummann 

Assistant Professor, Department of Applied Mathematics, Washington University 


M. E. J. Geuhry de Bray in his charming little book “Exponentials made 
Easy”' tells how to determine the constants in the equation, 

(I) y = + Ace*** 

so that the curve will pass through four points, with equidistant ordinates on 
an empirical curve. If (Fig. 1) yniVuVit 2/3 are the equidistant ordinates 
and 5 is their common separation, yo being the y intercept of the curve, de 
Bray's formulas are; 

/TT\ ^ ^ log Zj 

(II) ^ ~ ~J~ 


where «i and Zj are the roots of the quadratic equation 

2 * z 1 


(III) 


= 0 . 


ys 2/2 2/3 

I 

Vi yi Vo I 

The coefficients Ai and Aj of the two exponential terms are obtained by solving 
the two simultaneous equations 


(IV) 


Ai -h Aj = j/o 

AiZi -f" AtZi = 1/1 


In attempting to find suitable empirical equations for some “river rating 
curves” — graphs of discharge versus stage — the writer tried to make use of 
de Bray’s procedure. The original intention was to use the above method to 
determine the constants, and then to correct these constants by the use of 
Least Squares, as done by J. W. T. Walsh* in an application of the method to 
a problem in radioactivity. It often happens that a series of plotted obser- 
xations suggest a simple exponential function, but that when the observations 
are replotted on semi-logarithmic paper a straight line is not obtained. Often, 
as in the case of a good many river rating curves, the result may be described 


3 Macmillan & Co. Ltd., St Martin’s St., London W. C. 2 , 

• Proceedings Phys. Soo London XXXII. This reference is given by de Bray in his 
book, “Exponentials made Eafy.” 
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as “almost straight.” At first blush it might seem that in all such oases it 
ought to be possible to fit a curve with equation I to the data by de Bray’s 
Method. By an easy generalization of the above formulas, the constants in 
an equation with three or four exponential terms could be determined if two 
terms were not enough to secure a good fit. 

It was soon found, however, that innocent looking monotonic curves without 
points of inflection plotted from data that gave an “almost straight” line on 
semi-logarithmic paper quite often led to a quadratic equation, (equation III) 
whose roots were not both positive numbers. 


Y 



If Zi and Zj , the roots of III, are complex conjugates, it may be seen from IV 
that Ai and As will be complex conjugates. Also, oi and oj will be conjugate 
complex numbers and may be calculated as follows : 

Let Zi = and Zj = re~‘® 
then from equation II, 

whence, by division to eliminate r we have 


(Va) 


2ie 

-y = Oi — 02 . 
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Also, by multiplication to eliminate 6, 


(Vb) 


y»2 — Qj* 


2 log r 
6 


=. fli + 


02. 


The sum and difference of the two o’s being obtained by these expressions, 
one may solve for aj and Oj. 

Let Oi = X -|- i/i A.I = ft -|- 1/3 

<h = X — tn As = oc ~ 1/3 

Then equation I becomes 

y = (a + i)3)«f’'+‘'‘’® + (ft — 

y = cos liX — ^ sin ytx], or 

(VI) y — 2e^‘’R cos (nx + c) 

Q 

where R = s/cA 4- ^ and tan c — 

ft 


If one of the roots of III is negative, the de Bray formulas II and IV •will 
still give an expression for equation I which formally reproduces ya, yi,yi, and 
ys when 0, S, 25, and 35, are substituted for x respectively, but which is useless 
for interpolating and of no value as a solution of the curve fitting problem. 
Suppose, for example, that Zi is positive and Zs is negative. Then 

zs = (—1) 1 Zi 1 and 

log Zs = log (- 1) + log I Z 2 |. 

Equation I then becomes 

* X log 1 1,1 

j/ = Aie“‘* 4 -(-l)*A 26 ' , 

X 

the factor (—1)* being real only when x is an integral multiple of 5. If the 
(— 1) is written e'^, we have 

T % X X l<^ I St I 

y = ’ Ate ' , or 

^lo^r , ^ 3.-1 

y = Alt' + Aae * cos — + i sm — . 

Neither the real nor the imaginary part would be a graduation function for a 
monotonic curve as each has a half period of 5. 

The expression for I is similar, and of no greater practical value, if both of 
the roots of III are negative. 
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Without loss of generality we may let i/o = 1, ri = ^ rj = - ra 

c/0 c/1 

the quadratic III becomes 


2 * 

Z 

1 

TlH 


1 

TiTi 

n 

1 


or, 


Then 

Vi 


written in the form 


2 * + -f (? = 0, i.e., 


(Ilia) 


2^ + 


nin - n) , nr^Cra - n) 
{n - n) ^ (r* - ri) 


= 0. 


Hence the roots of this quadratic are real and unequal if H > 6, equal if H = 6, 
and complex if Z) < 6, where 




l + 4rr? + ^' 

b-i 

J'aj 

1 L?"* ’'sj 


Trom the point of view of the computer, however, it is about as much work 
to calculate D as to solve the quadratic equation. 


P 



Reverting to equation Ilia; suppose the numbers $ and p are plotted as the 
coordinates of a point (g, p) as in Fig. 2. Then the parabola p“ = 4g is, so to 
speak, a locus of equal roots. The remainder of the figure requires no expla- 
nation. 

Suppose that all the r’s are positive, as they would be in the case of a simple 
monotonic curve which one proposed subjecting to an exponential analysis. 
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If g < 0, the quadratic will have one negative root. Now 

-n) , , 

q = ^ and hence 

in - ri) 

for g < 0, if r 2 > ri , then rs < n and consequently r%<n> n and if n <rt, 
then ra > 1 * 2 , or ri > )'2 < rs . Also, provided > 4g, a positive p and a pos- 
itive g will give two negative roots. But 


nin ~ n) 

^ in - rO ' 


and p and g can not both be positive when all the r’s are positive as this implies 
either that n> n, J'l > and rs > n, a, contradiction, or else that rs < ri, 
Ti < rs and rs < rs, also a contradiction. Hence if both roots are negative, the 
r’s can not be all positive. The case of two negative roots will not arise in 
trying to fit equation I to a monotonic curve, since if all the r’s are positive 
both p and g can not be positive. 

For all r’s positive, provided p^ > 4g, a positive g and a negative p will give 
two positive roots. But 


and 


_ n r; h -^ ) 

(r2 - ri) 


_ rsirs - ri) 
^ irs - r,) 


> 0 


means that rs > r^ > ri or rs < r^ < ri. 

To sum up: If all the r’s are positive, de Bray’s method of exponential 
analysis is possible (a) when D < 6 and the roots of III are complex; (b) when 
D > 6 and n > r 2 > rs or when ri < rs < rs. 

Figure 3 gives a picture of the second condition (b) of the preceding para- 
graph. Suppose an exponential curve is passed through the first two points 
on the empirical curve with ordinates j/o and yi . Its equation will be: 


S X 



Suppose also that ys is less than the ordinate to this curve when x = 2S. Now 
pass an exponential curve through yi and ys using a new axis of ordinates 
coinciding with yi . Its equation is 


X X 
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or referred to the original axis: 

y = Vin * . 

Now if the graduation is possible without using trigonometric functions, yt 
must be less than the ordinate of this second curve when x = 35. 



Y 



It is natural to inquire if the state of affairs is not similar to this, for the 
cases of fitting curves with equations similar to I but having three or four ex- 
ponential terms on the right hand side instead of only two. If three terms are 
used (see Fig. 4) to find constants in 


(la) 


atx 


y = Aie*'* + A 2 e‘‘“ + Ase' 
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it- is first necessary to find the roots of the cubic 


(Ilia) 


2® 3* 2 1 


fix) = 


Vt Vi Vz Vi 

Vi Vz 2/s Vi 


2/s Vi 2/i 2/o 


Now, fix) will have no negative roots if /(—a;) has no changes of sign. But 
writing the conditions that the cofactors of the elements of the first row in 
the above determinant have the same signs, and assuming that all the j/’s are 
positive, one does not get a series of conditions analogous to > ri or 

n <ri < ri. 

In the following, formulas will be derived for finding the constants in equation 
la after the roots of Ilia have been determined. Also formulas will be obtained 
for finding the constants in 

(Ib) y = Ai e"** + As + Ai e*** 


after the roots of 

(Illb) 


2^ 


2* 

2 

1 

y? 

ye 

ye 

ya 

ya 

y« 

y* 

y* 

ya 

ys 

Vi 

ye 

ya 

Vi 

yi 

Vi 

ya 

y* 

yi 

yo 


have been found. Both sets of formulas have been tested by an “exponential 
analysis” of the same body of data, viz., the very accurate recent detemiiina- 
tions by the U. S. Bureau of Standards of the saturation pressure of water 
vapor above lOOC.^ 

For the case of three exponential terms in the graduation function, the o’s 
are found by formulas like II or V, after the roots of the cubic are found. If 
2i, zs, Zs are the roots, the A’s are obtained by solving the simultaneous equa- 
tions 


Ai -|- Ai -j- As = yo 


(IVa) AiZi -f- ^222 -f- AaZj = 2/i 

AiZi^ "f" A^z^ -j- AsZs — Vi 


“ Osborne, Stimson, Eiock, and Ginnings>: The Pressure of Saturated Water Vapor in 
the Range 100° to 374°C. Bureau Standards Journal of Research, Vol. 10, Febr. 1933, 
page 178, 
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This presents no new difficulty unless two of the roots are conjugate complex 
numbers. In this event, if we let 2i = the real positive root, 23 = r «•*, and 
’ 2a = r the determinant D of the equations IVa may be written 


D 






or, expanded in terms of the elements of the first column and their minors, 
D == 2t[zir^ sin 25 — (r^ + 2jr)sin 5 ], 
a pure imaginary. Similarly, 

AiD = 2i[r^yi sin 20 — (yor® + j/2r)sin 0], 


also a pure imaginary, so that Ax is real. Having calculated .Ai , it is substituted 
in the first two of equations IVa, which are then solved for Aj and As. <h 
and fla are then determined by formulas Va and Vb, replacing the subscripts 1 
and 2 in those formulas, by the subscripts 2 and 3 respectively. Finally the 
two exponential terms corresponding to the complex roots of the cubic are 
combined into a single trigonometric term as in equation VI. 

The necessary formulas for the case of four exponential terms in the gradu- 
ation function will be discussed briefly. The equations 


At -|- Aa Aj -h A4 = j/o 

Ai2i A52J -j- Ag28 -f- A 424 = yi 

(IVb) 

Ai2i -f- Ai!i\ A325 d" A424 = 1/2 

AiZi Aj 22 -j- "b A424 = ys 

have to be solved for the A’s. The z’s are the roots of Illb. Two cases will 
be considered: First case: 21 and 22 are complex conjugates and zs and 24 are 
complex conjugates. Second case: 2i and 22 are complex conjugates and 23 
and 24 are real and positive. In either event Ai and A2 are complex conju- 
gates, as will be proved below. Formulas for Ai are given for both cases. 
Then A2 is known since it is the conjugate of Ai . Having found Ai and A2, 
let 


Co — 2/0 — (Ai -)■ A2) 

Cl = yi - (Ai 2 i -f A222) 

Both Co and Ci are then real. To get Aj and A4 solve the equations : 

A3 A4 != Co 
A 323 -|- A424 = Cl 
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A pair of exponential terms with conjugate complex coefficients will then be 
expressed as a single real trigonometric term as in VI. 

The determinant of equations IVb may be written 


(VII) D = (Zi - Zt)iZi - Zs)(Zl - Z4)(Z2 - Zs)(Zs - Zi){Z3 - Zi). 


First case: Let Zi = a zj = a — th, Za = a + t/3, Zi = a — i/3. Then D 
may be written 

(Vila) D = -m(a - ay + (b- m ((« - «)= + (6 + 
which is real. Now 


AiD A" AiD = 


= (Zl - Zi) 


2/0 

1 

1 

1 


1 

2/0 

1 

1 

2/1 

22 

28 

24 


Zl 

2/1 

28 

Z4 




4 - 


zf 

zl 

2/2' 

2* 

2? 

z\ 


2? 

2/2 


3 

3 



3 

y» 

a 

3 

2/8 

22 

23 

Z4 


2? 

Za 

24 



0 


2/0 

1 

1 




1 


2/1 

28 

24 



(21 4- 22) 

2/2 

Za 

2*4 


(2 

1 4" 2iZ2 4" 2*) 

2/8 

2? 

2® 



and this is real since (zi — Za) is a purd imaginary and the minors of the real 
elements of the first column of the determinant are all pure imaginaries. Hence 
Ai and At are complex conjugates since when each is expressed as a quotient 
of two determinants by Cramer’s rule, the sum of the two numerators is real 
and the common denominator is also real. 

' For purposes of numerical calculation Ai may be obtained from 


Ai = 


D 


in which D is obtained from Vila, 

V = 2/3 - (Z2 4- 23 + 24)2/2 + (2228 + ZtZi + 2324)2/1 - (222324)2/0, 

and P = {zt — Z 3 )(Z 2 — 24 ) (za — Z 4 ) 

= 2/3[(a! — a)2b + i{(a — a)* -j- (jS* _ 6*) j], a complex number. 

If ZiZt = r* and z^Zi = pS the symmetric functions of the z’s in the above formula 
may be calculated from 

Z 2 Z 324 =: (a — ib)p® 

ZaZs + 22Z4 + ZaZ4 = p** 4- 2a{a — ib) 

Z3 4- Za 4 - 24 = (a — lb) 4 " 
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■For the second case, which is exemplified by the vapor pressure data, 

(Vllb) D - 2tb[(a - 23)= + [(a - 24)“ + b^] [23 - 24], 

a pure imaginary. The sum of the two numerators of .4,1 and At, namely 

yo 1 1 

2/1 2s Si 
Vi si z\ 

2/s 2I si 

is a pure imaginary, since (21 — 22) has this character, and the determinant 
has nothing but real elements. Hence Ai and 42 are still complex conjugate^ 
when 2® and 24 are real, 2i and 2a being complex conjugates. 

' For purposes of numerical calculation 4i may be obtained from 

N 

^ “ (2i - 22) (21 - 23) (2i - 24)' 

Here (21 -■ 22) is a pure imaginary and the other three factors are complex. 

Let N = ri(co8 i sin di) 

2i — 23 = raCcos ^2 4- ‘ siu 5a) 

2i — 24 = rsCcos 5s + t sin 5s) 

Then 

^ _ n [cos (5i — 52 — 63) -f t sin (5i — 52 — 53)] 

' ^ {si- Si) ra u 

In calculating N by the formula given for it in the preceding paragraph, the 
symmetric functions of the 2’s were obtained from 


(21 - Si ) 


0 

1 

2i + 22 

si + siSi + zi 


222324 == (a — ^b)z^Zi 

2223 + 2224 + ZsSi == (o - tb )(23 + Zi) + Z^Zi 
Si Zi Zi =1 (a — lb) -t" '*3 -b *'4" 

Example 

The first two of the following tables are abstracted from Table 2, p. 178 of 
Bureau Standards Research Paper No. 523. The third table is abstracted from 
Table 3, p. 179 et. seq. of that publication, x is the number of degrees centi- 
grade above 100°. y is the pressure of saturated ■ssi'ater vapor in International 
Standard Atmospheres. In the first two of the following tables, the values 
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of y are observed values. In the third, they are interpolated or graduated 
values calculated at the Bureau of Standards. 


TABLE I 


x 

y 

0 

1.0000 


12.3887 

180 

63.3558 

270 

207.771 


TABLE II 


iS 

y 

0 

1.0000 

50 

4.6969 

100 

16.3472 

150 

39.2666 


84.7969 

250 

163.206 


TABLE III 


X 

4 

y 

0 

1.0000 

39 

3.4666 

78 

9.4490 

117 

21.612 

156 

43.392 

195 

78.974 

234 

133.64 

273 

215.37 


The observed values of y in Table I are reproduced by the following formula 
used in conjunction with a standard six place table of logarithms and trigono- 
metric functions: 

(I) y = 3.967433 « cos {.4086768a! - 76°24'03".7). 

The observed values of y in Table II are reproduced by the following formula 
used in conjunction with a standard six place table of logarithms and trigono- 
metric functions. 


(ID 


y = 3.0263744 

+ 2.2171657 eoiiwwne, cos (155°59'35".5 - 0.7899232a;). 














144 


H. B. GBtJMMANN 


Hence the formula is presumably an excellent one for interpolation between 
the values of y listed in Table II, if the greatest accuracy is not needed.'* 

The values of y in Table III are reproduced exactly to five significant figures 
by the following formula used in conjunction with a standard six place table 
of logarithms and trigonometric functions. 

y = 3.8902543 - .164787 

+ 2,743000 cQg (.7860726a: + 186‘’28'53".2). 

By means of this formula the saturation pressure of water vapor was calculated 
for every five degrees from 100°C to 370°C in order to make comparisons with 
the corresponding “smoothed" values in Table 2 of the Bureau of Standards 
publication referred to above. The discrepancies were never more than one in 
the fourth significant figure and generally less. The poorest agreement was 
in the ranges of temperature from 100°C to 135®C and from 245°C to 270'’C. 

It is a pleasure to acknowledge the intelligent and painstaking assistance of 
Mr. G. D. Lambert, undergraduate student at Washington University, for doing 
most of the computing. 

Washington University, 

8t, Louis, Mo. 


* The values of y in Table III (not counting the value of y for x = 0) are reproduced 
by it with an average error of .13% and a largest error (for x = 234°) of .30%. Four of 
the errors are negative and three positive. 



ON THE FREQUENCY DISTRIBUTION OF CERTAIN RATIOS 

Bt H. L. Rietz 
U niversity of Iowa 

Considerable interest in the distribution of ratios, t = y/x, has no doubt 
been suggested by important applications. For example, we may mention the 
opsonic index in bacteriology, the ratio of systolic to diastolic blood pressure 
in physiology, and ratios such as link relatives or certain index numbers in 
economics. 

In 1910, Karl Pearson^ gave certain properties of the distribution of ratios 
by means of approximate formulas for moments up to order four in terms of 
means, variances, product moments, and coefficients of variability of x and y. 
The resulting formulas did not give, with sufficient accuracy,, the constants of 
the distribution of the opsonic index for the purpose of Dr. Greenwood to whom 
Pearson attributed the derivation of the formulas for the special case in which 
X and y are uncorrelated. Pearson next adopted the plan of tabulating the 

reciprocals, say x' = -, and then finding the constants of the distribution of 

X 

the product yx' in the case in which x' and y are uncorrelated. He then ob- 
tained satisfactory results in illustrative examples. 

In 1929, C, C. Craig* obtained the semi-invariants of y/x in terms of moments 
of X and y, and then expressed the moments in terms of the semi-invariants of 
the distribution function, fix, y), of x and y. By this means, he was able to 
deal with the case in which x and y are normally correlated under suitable 
conditions. Craig found it desirable to restrict the distribution of a: in such a 
way that the probability of a zero value of a: is an infinitesimal of sufficiently 
high order that a certain integral exists. This limitation seems to imply in 
applications to actual data that no zero values of x are to occur. This suggests 
that we deal with the cases of x at or near zero with considerable care. 

By starting with the assumption that the values of x and y are a set of 
normally distributed pairs of values with correlation coefficient r, and by con- 
sidering the quotient z = — a and b being Constants, R. C. Geary,’ in a 

paper published in 1930, found an algebraic function, u => /(z), of fairly simple 
form with the property that u is nearly normally distributed with arithmetic 
mean zero and standard deviation unity provided that a + a; is unlikely to 

' On the constants of index distributions, Biometrika, Vol. 7 (1910), pp. 531-546. 

* The frequency function of y/x, Annals of Mathematics, Vol. 30 (1928-29), pp. 471-486. 

’ The frequency distribution of the quotient of two normal variates, J. Royal Statistical 
Societv. Vol XCIII (1930), pp. 442-7. 
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have negative values. Here we have again a suggestion to exercise special 
care in the case of quotients with the divisor near zero or negative. 

In 1932, Fieller^ obtained in explicit form the approximate distribution of 
t = y/x where values (a;, y) are drawn from the bivariate normal distribution 

1 _i 1 / (»— 'sy jy-y)' ^ 

g 2 l-r*\ Ox* / 

2 ^ 0-1 (Ty 'sj \ — r‘‘ 

under the condition that S, is large compared with tr* . 


y 



Very recently Kullback® found the distribution law of the quotient, t = yjx, 
where x and y are drawn from Pearson Type III parent populations given by 

/i(^) = Siiy) = oga:goo, ogj/S«>. 

It is fairly easy to see, in a general way, that the distribution of i yfx 
depends very much on the location of the origin as well as on the parent distri- 
bution from which x and y are drawn. This fact will be fairly obvious from the 
present paper whose main purpose is to give clear geometrical descriptions of 
the distributions of ratios, t = y/x, for each of several cases in which {x, y) 
are points taken at random from certain simple geometrical figures conveniently 
located with respect to the origin. 

In accord with the suggestions to be cautious when the divisor is near zero 
or negative, we consider first the very simple case of ratios i = yjx obtained 

‘E. C. Fieller, The distribution of the index in a normal bivariate population, Bio- 
metrika, Vol. 24 (1932), pp. 428-440. 

“ Solomon Kullback, Annals of Mathematical Statistics, Vol. VII (1936), pp. 51-63. 
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from points uniformly distributed over a rectangle such as is shown in Fig. 1 
with sides parallel to coordinate axes and ai > 0, &i > 0. As indicated oh Fig. 1, 
we assume for simplicity that the coordinates of the points are positive and 
ui S a: g 02, 6i ^ y S 

Case I. When — g — , Fig. 1. 

Oi 02 

Let k dx dy be the probability that R'point (x, y) taken at random in the 
rectangle will fall into dxdy where fc is a constant. Then 

rat 

k I dxdy = h(ch — ai){hi hi) = 1, 

Jbl Jai 

^ {<h - ai){b, - bi)- 

Transform the element k dxdy into one with variables t, and x by making 


X = X, 

y = tx. 


The Jacobian is j a: ] = x. 

The new element is fc x dxdt and is to be ihtegrated over the range on x for 
an assigned t in order to get the probability, to within infinitesimals of higher 
order, that a random i falls into an assigned dt. By assigning t any value such 

that — ^ i ^ , say t is the slope of MN, (Fig. 1), we have 

02 Ol 

(1) fc xdxdt = dt 


the limits of integration being indicated by the ends of the line MN. 

h 

dx Og 

M'N' , we have 


When the assigned i is such that — ^ i ^ , say t is the slope of the line 


( 2 ) 


- l : 


^ r 2 


k I xdzdt = ~{al ~ al) dt 


WTien the assigned t is such that — ^ t ^ say it is the slope of M"N" , 

02 Ol 

we have 


kj X dx dt = ^(^ — a\j dt 


( 3 ) 
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Thus, 

oftisgi^enby 

(4) 

(5) 

( 6 ) 

See rig- 2 


(ll (2)> %• 1) - S -, the tiequeacy function 

^ fli flj 


(oj - when. - ^ t g -, 

\ iV ds tti 

F(t) = 5(4 - Oi) when -gig-, 


F (0 = I 


hfbl 


Fit) = 0 llT ~ “i ] when - g t ^ ^ 


ai 


bi < b 


{or the general form of the frequency curve F(t) when ~ 
with the segment from t = a horimtal straight hne and with disco 


tinuities in 


the 


ftv 


first derivatives of F{t) at t = and t = -• 

Os 



y?hen ai ^ ^’^^'I'^ency curve approaches 


Fit) - ~ when 0 g i g 


(7) 

( 8 ) 


b 

Os 


when t^b. 

,»J Os 

. j 1 . — 0 extends 

■,. ^„v be noted that the curve given by making Oi = 0 and oi_ - 
JLSy , and that the first and second moments about the ongm 

infinite. 
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Case II. When 

Oi aa 


If the rectangle in Fig. 1 were moved upward keeping its sides parallel to the 
X and y axes until - > we would obtain 

fli fla 


(9) 

( 10 ) 
( 11 ) 


F{t) 




m 


if 


Pit) hi) 




- S i ^ 

fla Oa 


fla 


ai 


if 


- ^ t <-. 


ai 


ai 


By comparing (5) and (10), it may be observed that Fit) of the middle seg- 
ment of the distribution curve differs much in Case II from its corresponding 
constant value in Case I. 

By moving the rectangle of Fig. 1 downward, keeping its sides parallel to the 
X and y axes until hi is negative, we easily find further forms of the distribution 
curve P(t). 

To consider the distribution of the ratio t ~ y/x for another very simple type 
of distribution of x and y, suppose we have given the distribution function 


( 12 ) 

where 



fix,y)<=ke"^ ^ 
fix, y) dxdy = 1. 


/x ^ c > 0, 2/ non-negative 
' \a > c, 6 > 0 

Then 


) 


k = 


ge/o 

ab' 


In this case, 


(13) 




X Xt^ 

X e “ ’’ dx 


1 

b al 



ah \ 
h + at) 


C { 



a monotone decreasing function from t = 0 to i = <» . 
With c = 0 as a limiting value, we obtain 


(14) 


Fit) = 


ab 

ib 4- a<)®' 


a distribution curve with the mean value of t at infinity. 
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If we sliould similarly consider 

_ _ ji2_ 

(15) f(x, y) = — ^ e (a: and y non-negative) 

'IrtTfO'y 

we easily obtain 



as the distribution function. 

Although the difidculties® of the problem of the distribution of the ratio y/x 
when X and y are normally correlated have been overcome'^ to a considerable 



Fig. 3 


extent, still the examination of some very simple geometric cases of non- 
normal but linear correlation may not be without some interest. Such a case 
will now be considered. 

For one very simple case in which x and y are correlated, suppose we are 
given a set of points {x, y) uniformly distributed over the parallelogram ABCD 
(Fig. 3) with sides AD and BC parallel to the 2/-axia so that the regression of 
y on a: is linear as shown by the line RS. 

The equation of RS is 

(17) y — mix — ai) + \ 

• Log. oit,, Pearson, p. 631. 

' Loo. oit., C. t). Craig, B. C. Geary,' E. C. Pieller. 
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Then although X{ and y, are correlated, *,• and 
y'i = Vi ~ - 0 .) - 

/ 

are uncorrelated. Let us consider the distribution of the ratio t' = — . 

Consider the element of frequency kdxdy', where 
(18) kibi — bi) (oi - Oi) = 1 . 

Change variables to x and t' by the transformation 


X = X, 

y' = t'x. 

Then the element of frequency becomes 

(19) kx dx dt'. 

Next integrate (19) with respect to x under the restriction that i' is assigned. 
Three cases occur : 


s (' s 


(a) Wl„a 2 „. - - - 2a, 

element of relative frequency of t' in di', 


, we obtain by integration of (19) for the 


( 20 ) 

(b) When t' ^ 


k 


I X dx dt' = 

J ax 


/ 2 


{a\ — a\) dt'. 


- 62 — bi 


20 ? 


, we obtain 


6 j— ’ 


( 21 ) k£"‘' xdxdt' = I - a! J dt' 

(c) Wheni' g — we similarly obtain 


( 22 ) 


k 


^ 0,2 

bt—h i 

r it' 
Jai 


z dx dt' = - ^ - ai J dt 


From (18), (19), (20), (21) and (22), the frequency function of t' is given by 

1)2 — hi. 

20 * ' 


(23) 

Fit') = 

2(62 - hi) 

bi — hi ^ 
202 “ 

t' g 

(24) 

Fit') = 

1 f 

■(h 2 - biY 

- o! 

2(62 - hi) (ct 2 - «l) L 
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where the range of i' is subject to either the inequalities, 

^ or - ^ 

2a2 2ai 2ai 202 

See Fig. i for the general form of the Fit') frequency curve. 

If we make oi = 0, the curve becomes infinite in range. If we make not only 
oj = 0, but ih + h)/2 = 0, we have, in place of (17), 


y = mx. 


In this limiting situation, if we make Os — a and 



= i, 



(23) becomes 

(25) Fit') — for — - ^ and (24) becomes 

40 a a 

(26) Fit') = — ^ for t' and for i' g — 

4 at^ a a 

Then we have y' = y — mx 
and i' = - = i — m. 

X 

Further, if t' is distributed in accord with a frequency function, Fit'), the 
distribution of i = <' -f m with m constant is given by 


Fit — m). 
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Hence, the probability that a random value t will fall into a range t to t + dt 
is given to within infinitesimals of higher order by 


(27) 

a* 

and by 


(28) 

b dt 

4o(t — my 


when m + 

a a 


when 


t ^ tn + - and t ^ m — 
a 


b 

a' 


With the frequency curve given by (27) and (28) we may note that the variance 
of t becomes infinite. 

Without taking the space to continue illustrations, it is fairly obvious that a 
wide diversity of form can be given to the frequency function of the quotients 
( = y/x by relatively simple changes in the location of a sample parent popu- 
lation with reference to the origin. 
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THE EUNDAMEHTAL NATURE AND PROOF OF SHEPPARD’S 

ADJUSTMENTS 

In the course of our discussion of moment adjustments, we shall have occasion 
to refer to the following lengthy distribution of discrete variates. By selecting 

TABLE 1 


Distribution of the number of items correctly recorded by 244 students in a five 
minute code transcription test* 


Score 

X 

Preq. 

Score 

X 

Freq. 

f 

Score 

X 

Preq. 

f 

64 

1 

94 

3 

119 

1 

66 

2 

96 

5 


2 

68 

2 

96 

3 


6 

69 

1 

97 

3 


2 

70 

1 

98 

12 

123 

3 

71 

3 

99 

4 

124 

2 

72 

3 

100 

5 


6 

73 

3 

101 

6 

126 

3 

76 

1 

102 

8 


4 

77 

2 

103 

6 

128 

2 

78 

3 

104 

8 


2 

79 

1 

105 

9 

131 

1 

80 

2 

106 

5 


5 

82 

2 

107 

3 

HH 

1 

83 

3 

108 

3 

■H 

1 

84 

2 

109 

4 


1 

85 

6 

no 

2 


1 

86 

» 3 

111 

4 

■iH 

1 

87 

1 

112 

7 

141 

1 

88 

2 

113 

5 

142 - 

2 

89 

4 

114 

5 

144 

2 

90 

4 

115 

7 

153 

1 

91 

5 

116 

8 

155 

1 

92 

2 

117 

3 



93 

4 

118 

* 2 

Total 

244 


* I am indebted to Professor J. A, Gengerelli, of the Department of Psychology of 
Univ. of California at Los Angeles, for these data. 
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the provisional mean, Mo = 105, we find that 

2x f = -129 Sa;’/ = -52 005 

S = 77 591 S x^f = 69 239 951. 

Let us now form the nine possible distributions of grouped-discrete variates 
that arise from the nine possible “groupings of nine.” These are presented 
in table 2. 

TABLE 2 

Distributions derived from the data of table 1 by making the nine possible 

“Sroupings of nine” 


First significant class interval of distribution 


(1) 

64-72 

(2) 

63-71 

(3) 

62-70 

(4) 

61-69 

(5) 

60-68 

(6) 

59-67 

(7) 

58-66 

(8) 

57-65 

(9) 

66-64 

13 

10 

7 

6 

5 ] 

3 

3 

1 

1 

12 

15 

16 

16 

14 j 

14 

13 

15 

1 IS 

27 

23 

21 


22 j 

21 

16 

14 

11 

41 

41 

; 33 

32 

30 

28 

31 

29 

30 

53 

54 

63 


55 1 

52 

49 

45 

41 

45 

45 


38 

42 ! 

45 

44 

48 

52 

27 

27 

29 

34 

36 

39 

40 

42 

43 

16 

19 

24 

25 

23 

24 

28 

30 

29 

8 

6 

7 

6 

10 


12 

11 

13 

1 

2 

2 

4 

5 

6 1 

6 

7 

7 

1 

2 

2 

2 

2 

2 

2 

2 

1 



1 

! 

1 





1 


Let US now compute the values of Sx/, Xxf and for each of the 
distributions of table 2, selecting Mo = 105 in each instance in order to facibtate 
a comparison of these results with those for table 1. Thus, in spite of what 
would otherwise be called poor computing technique, we shall use the following 
class marks as values of x for the first distribution above; —37, —28, —19, ■ • • , 
35, 44, 53. For the second we shall likewise use, —38, — 29, — 20, ■ • ■ , 34, 43, 
52, respectively. 

TABLE 3 


Summations derived from the distributions listed in table 2, using Mo = 105 


Dist 

Xx/ 

xxy 

Si’/ 

Xx*f 

(1) ' 

- 181 

77 149 

- 134 191 

69 063 265 

(2) 

— 218 

78 466 

- 54 602 

74 519 962 

(3) 

- Ill 

77 769 

2 889 ! 

71 465 409 
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TABLE 3 — Continued 


Dist. 

Xxf 

Sx*/ 

Sx*/ 

SxY 

(4) 

- 139 

79 747 

- 23 311 

74 171 443 

(5) 

- 104 

81 934 

19 666 

76 143 874 

(6) 

- 87 

80 145 

16 551 

72 467 541 

(7) 

- 52 

80 302 

- 36 118 

71 851 930 

(8) 

- 89 

78 553 

- 101 357 

68 426 497 

(9) 


78 894 

- 180 792 

73 155 160 

Average 

- 129 

79 2171 




The fact that the average of the values of Sa:/ appearing in table 3 suggests 
that no adjustments of the first moment is necessary and that the variations 
in the nine values for Sa:/ may he regarded as accidental errors and attributed to 
grouping. An attempt to account for this phenomenon and also for the fact 
that the averages of the higher order summations of table 3 do not likewise agree 
with the corresponding summations of table 1 lead us directly to formulae for 
Sheppard’s adjustments. 

For the moment, let us concentrate our intention upon a single variate, xo, 
and its associated frequency, that are a part of a distribution of discrete 
variates, such as table 1. Suppose we were to form the fc different distributions 
arising from the h possible “groupings of h." In one of these distributions, 
Xo will rest in the first position of a class interval : the limits of this class are xj 
and (xo + A; — 1) and the class mark is therefore [xg + — 1)]. The 

contribution of the variate, xo, to Sx*/ for this particular distribution is therefore 

[% + §(A: - 

If Xo rests in the second position of a class, the limits of this class will be 
(xo — 1) and {xo + k — 2) and the corresponding class mark is [xo + — 3)] 

and the contribution of xg to Sx*/ for this distribution is 

[xo + Kfc - 3)]*-/^. 

The expected value of Sx* / arising from the k different groupings of variates is 
therefore, 

X 

where refers to that distribution ip which a specified xo rests in the f-th 

position in the class in which it occurs. The contribution of Xo to this expected 
value is therefore 
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(2) ^ {[ 2:0 + Hit - 1)]' + [%+!(*- 3)]' + [X 6 + ^ (^! - 5)]* + • • . j , 

this series consisting obviously of k terms. 

Expanding each term of (2) by the binomial theorem yields 


*0 

- .Ci2:r* 1 

- 1^ 
< 2 y 

1 + .Cj xr* i 

fk- 1 ' 
2 

y-.CaxrN 

fk~i' 
[ 2 

)■-] 

xl 

-jCixr^ 1 

V 2 , 

| + .Csxr* 

fk~5 
{ 2 

)'-.c',xr® 

fk~B' 
k 2 

)*-] 

ajfl 

- .Cl xr^ 1 

^k — 5 '' 

V 2 y 

l + Axr* 

ffc - 5 
^ 2 

)'-.c,xr 

fk - 5 
V 2 

)•-] 


etc. 

Since s is an integer, series (2) may be written as the sum of the (s + 1) terms 
of the series 


(3) [x5 s, - .Cl xr^ Si + .C 2 xr" & - .c* xr^ Si + 

where 

®‘ ■ j [(^ 7 + + (^ 7 + • ■ ■ * ‘'H • 

By the Euler-Maclaurin Sum Formula we have 
x>> - — ^ - a^’+i) + ^ P 

p + 1 2 ! 

_ ^ pW (&r-« _ ^ p<« (bp~^ - a*-') + • • • , 

4 ! D 1 


where pW = p(p — 1) (p — 2) (p — 3) • • • to i factors. In our expression for 
S,, a = 5 (fc — 1 ) = —b, and therefore Si equals zero when i is an odd integer. 
For even values of i, 


( 4 ) 



J(k~iy(k + i) . Bi,/fc- iV • 

\ 2>+i(l + f) ■^2! \ 2 / 
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SO that 

So = 1 

s, = l (k^ - 1) 

= (3fc^-7) 

1^ ~ 
etc. 

Since expression (3) represents the contribution of any variate, a^, to the 
expected value defined by (1), we may obtain by summation 

(5) E(Z *•/) = E *•/ + A • So • E + .Ci • S, E + • • • . 

To illustrate: if we desire to shorten the distribution of table 1 by fonning class 
intervals of dimension 9, 

Si = - 1) = ^, ^ (9“ - 1) (3 . 9^ - 7) = 

and by formula (5), 

®(E a: /) = E - 129 

on 

E(Z a:V) = E + A • & • E/ = 77591 + ^ • 244 = 7921^/’ 

on 

■B(E a;’/) = Ea;’/ + sCs . & • E ^/= - 52005 + 3 . y (- 129) = - 54585 

E(E^*f) = Ea:V + A.S2-E^*/+ A'S4-E/, 

= 69239951 + 6 ~ • 77691 + ^ • 244 = 723627852/® . 

Since these expected values are identical with those computed directly in table 3, 
we see that formula (5) provides the adjustments necessary to eliminate the 
effect of the systematic errors caused by grouping. 

Dividing both sides of (5) by S/ yields 

(6) E{n,) = /i» + iCj • Si • M>-2 "1“ ' Silig-i -)- jCe • St • Mj— 6 4" • • ■ I 

Eini) = n'l 

Eini) = ^ (fc“ - 1) 


that is 
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E(h) = M8 + ^ (fc* - 1) /i 

Ei/,) = Mi + ^ (fc* - 1) ^ - 1) (3fc“ - 7) 

Ein',) = + “ (fc* - 1) Mi + ^ (ft* - 1) (3fc* - 7) mI 

etc. 

In numerical computations we generally prefer to select the class interval 
as the unit of x and in this case we have 

E(/x) = ix[ 

Ei/,) = 

E(/,) = Mi + j|(l - i) 

E(m J “ Ml + j 2 (l - ^) Ms + ^ (l - p) (3 - 

etc. ^ 

Ordinarily We are interested in estimating the values of the moments that 
would have been obtained if we had not used the time-saving device of grouping 
the variates and therefore we solve the previous set of equations for the moments 
of the ungrouped distribution and obtain 

ni ~ E{n[) 

- EC,;) - i (1 -i) «,;) + i (1 - (7 - 1 ) 

etc. 

In general we may write, corresponding to formula (6), 

(8) n'. = Ein'.) - .Ci . P, . -1- .Cl • Pi • E{ii',-x) 

where 
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In actual problems we do not know the exact values of the expectations 
involved in formulae (7) and (8), and are forced to obtain mere approximations 
by utilizing in their stead the corresponding moments computed from the 
single chance grouped distribution. These approximations correspond to those 
employed in the theory of probable error, namely, substitutions of the moments 
derived from a single sample for the corresponding expected moments of the 
parent population. 

The adjustments so far considered may properly be referred to, as Sheppard’s 
adjustments about a fixed point. At first thought it might appear that we might 
obtain corresponding formulae for the expectations of moments about the mean 
by merely dropping the primes in formula (6) and obtain, for example. 


M2 = 


but unfortunately this is not true. For example, the exact value for the variance 
of the distribution of table 1 is 18915563/24:4’*. Using the summations of 
table 3 and computing the variance for each of the nine groupings yields 


E{pt) = 


( 9 ) 


9.244“ 


[18791595 + 19098180 + 18963316 + 19438947 


+ 19981080 4- 19547811 + 19590984 + 19159011 + 19217736] 
= 19309851/244“. 


Since — 
1^2 


1) = -^ (9“ — 1) = 20/3 we see that 


Ui < Eiui) — ^ — 1 ) . 

In the theory of sampling we differentiate between the standard errors of 
moments about a fixed point and the standard error of moments about the mean 
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of the sample. Apparently writers on the subject of Sheppard’s adjustments 
have overlooked the case of adjustments about the mean, although the solution 
for the second moment is readily obtained as follows: 

EW = - M^) - E{n[) Em 

“ Mii + 1) — ^ (-^1 + M\ + • * ■ + M\) , 

where Mi represents the mean of the f-th of the h different grouped distributions. 
Since 


/ij = jts - = /is - i {Ml 4- Ms + • . + Mi) , 

E{li^ = ^ — 1) 

{M\^M\+ ... -{-Ml /Mi + M,+ ...-f MiV 
^ \ i /. 

But since for any set of k variates 



we have that 

( 10 ) 1 ) - 4 - 

Referring back to table 3 we find that 

2 _ 7856 
3.(244=*) 

and the numerical results now satisfy equation (10) . 

For the benefit of those interested in unsolved problems of mathematical 
statistics we may say that' nothing appears to have been written as yet on the 
most important problem associated with the systematic errors due to grouping. 
It is of course desirable to eliminate these systematic errors introduced by 
grouping, but it is even more important to investigate the distribution of the 
accidental errors that remain after the systematic errors have been eliminated. 
For example it is gratifying to know that no systematic errors are present in the 
Sk/ column of table 3 and that equation (6) will enable us to add a constant to 
each summation of the S*’/ column so that the mean of these adjusted values 
will agree with the value Sa;®/ = —52005 obtained in table 1. It is rather dis- 
concerting, however, to realize that in actual practice we tnay in the case of 
discrete variates and must in the case of continuous variates select an arbitrary 
set of class limits for our recorded data, and that after adjustments for grouping 
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have been made, onr estimates of the true values of the moments of the distri- 
bution will— as in table 3— depend so much upon the choice of these limits. 
Thus, the standard error of the mean attributed to grouping is 



which is about twenty percent as large as the approximation for the standard 
error of the mean due to sampling from an infinite parent population, namely, 

If one will take the trouble to compute the values of m and fn for each of the 
distributions of table 2, utilizing the summations of table 3, and then compute 
and compare the values of <r,,, and due to grouping with the corresponding 
functions associated with sampling, he will realize the seriousness of the situation. 

SuMMAHT 

The formula for Sheppard’s adjustments for distributions of grouped discrete 
variates was first given without proof in the Editorial of Vol. 1, No. 1 of the 
Annals (page 111). The method used to develop the general formula was 
extremely laborious and paralleled the method used for the case of continuous 
variates in the Handbook of Mathematical Statistics, Chapter 7, except that the 
calculus of finite differences was employed. A more satisfactory proof of this 
formula was presented by Dr. J. R. Abemethy in Vol. 4, No. 4 of the Annals 
in an article entitled “On the EliminaMon of Systematic Errors Due to Grouping." 
An extremely elegant development of the same formula and an extension to the 
case of two variables appears elsewhere in this volume by Professor C. C. Craig. 
From the point of view of expectations, all of these developments are adjust- 
ments about a fixed point, although this fixed point may be selected arbitrarily 
at the mean of the distribution in question. The obtaining of formulae for the 
adjustments about the mean of each grouping and the distribution of the 
accidental errors that remain after these systematic errors have been removed 
has apparently been neglected to date and should interest students of mathe- 
matical statistics. 

From a mathematical standpoint, the development of this paper is the 
simplest of all that have appeared to date: the adjustments for the first four 
moments can be worked out with the aid of the binomial considerations leading 
to formula (3) and the following well known formulae for the sums of the powers 
of the first n integers: 

j, _ n(n -b 1) c. + 1)* 

2 *3 1 

p _ n(w -t- l)(2n -(-1) ^ n{n -f l)(2n -f- l)(3n^ -1- 3n — 1) 

S, S, = 3 ^ 
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One should note that the condition of high contact is not required in this 
paper or in the developments of Abemethy or Craig. The results of the three 
preceding papers agree with those obtained about a fixed point in this paper, 
but fail to hold for the case of expectations about the mean, if we accept the 
following definition; 

^ (m».i + ‘ , (s = 2, 3, • ■ • ) 

where j«,,, designates the s-th moment computed about the mean of the f-th 
grouped distribution, (1 ^ f g fc). 


H. C. Carver. 
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ON A GENERAL SOLUTION FOR THE PARAMETERS OF ANY 
FUNCTION WITH APPLICATION TO THE THEORY OF 
ORGANIC GROWTH 

By Haery Sylvester Will 

Part I 

I The Problem Stated. A type of problem which continually arises in the 
ordmary course of statistical analysis is that of determining the numerical values 
of the parameters of a function used to represent a series of observational data. 
In mathematical terminology, the problem may be stated as follows: 

Given, the observational series Fo, Fi, • • • F„_i. 

Assumed, the function y = f{x, a,b,c, • • • ). 

To find, the numerical values of the parameters a, 6, c, • • • . 

If the function/(a:, a,h,c, ■■ ) is linear in the parameters, the desired solution 
is easily obtained by familiar methods. In cases where the function is not 
linear, the standard procedure is to reduce it' to the linear form by expansion 
into Taylor’s series, thus: 

/(x, a, h, c) = f{x, floboCo) + fa{x, a^(fin)'Aa + /),(x, aoboCo)-Ab 

+ fc(x, aoboCdj-Ac, 

where a = aa + Aa, b = 60 + At, c = Co + Ac. 

The use of this method suffers from the excessive labor involved as the number 
of parameters to be determined increases. In cases where satisfactory values 
of the first approximations ooboCo are not obtainable*, the solution becomes im- 
possible. The basic difficulty arises from the consideration that the Taylor 
theorem requires that the increments Aa, Ab, Ac shall be very small quantities. 

A method of successive approximation which makes feasible the reduction of 
gross errors in the corrections will, I take it, be of considerable interest to 
mathematical statisticians. Let us, therefore, proceed to the development 
of a technique which accomplishes precisely this result. 

II. The Theta Technique. Let us begin our development with the follow- 
ing restatement of the technical problem involved: 

Given, the observational series Fo, Fi, • • • F„_i. 

Assumed, the function y = f{x, (ao + 0 iAa), (bo + d^Ah), (co + ^aAc)). 

To find, the values of ^i, 6i, 83. 

in this set of relations, ao, bo, Co and Aa, Ab, Ac are known quantities; while 
$1, 82 and 8} are each assumed not to exceed ±1 in value.* It follows, therefore, 

165 
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that the adjusted values of a, b, and c lie within the bounds ao ± Ao, bo ± Ab, 
Co ± Ac. We may, then, write the following: 

tti = ho — Aa; a2 = Oo + Aa. 

h = bo — Ab; ba = bo + Ab. (2) 

Cl = Co — Ac; Ca = Co + Ac. 

The values of 61, 62 and 63 are determined by the following procedure: 

First, form the function y from all possible combinations of OiOa, biba, cica, 
thus: 

yiii = /(«, aibjCi). 

ym = Six, aibiCa). ■ ■ . . 


2/222 = f(x, OabaCa). 

In the case of p parameters, we can evidently form 2 ” distinct sets of n values 
for the function yu, . Since the assigned values of parameters are mere approxi- 
mations to their true values, each computed set of values for the function y,u 
will differ from the true values y = fix, obc). 

Second, form the theoretical residuals y,,, — V, and then compute the corre- 
sponding standard errors of estimate o-,,,. There will, accordingly, be 2 ” values 
of 0- determined, each value being a measure of the error committed in assuming 
the corresponding approximations to parameters; thus, o-m measures the errors 
committed in assummg the combination cibiCi ; a-m measures the errors com- 
mitted in assuming aibiCa ; • • • ; <r222 measures the errors committed in assuming 
<l2b2C2 ■ 

Third, taking the squared reciprocal of <r as a measure of the reliability of a 
given determination of y,,i from the parameters a,b,c,, we may form the follow- 
ing comparative tests of the reliability of the 2 »’ sets of the values of y;,, , thus; 

, —2 •/ —2 _| — 2 I I — 2 \ —2 —2 

’T <^112 \ "T <^2 2 2/ “ ^vii ' 

-2 . V ' -2 

— <yii2* 2^ * /.N 


2 t “"2 

0)222 = O' 2 2 2 • 2 Lj . 

Omega, we shall term the test constant. Obviously, = 1 . 

Fourth, assuming three parameters, let us tabulate the possible subscripts of 
omega according to the following scheme : 


“(ad 

< 11 ( 02 } 

o)(bi) 

w(i)s) 

w(ci) 

£i)(cs) 

111 

211 

111 

121 

111 

112 

121 

221 

211 

221 

211 

212 

112 

212 

112 

122 

121 

122 

122 

222 

212 

222 

221 

222 
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In this table, the subscripts are in the order of abc; so that 111 denotes 
b}{aibiCi); 112 denotes w(ai6iC2); etc. Comparing columns w(ai) and w(as), -we 
observe tiiat the he subscripts are identical for both; while the ai subscripts of 
the first column are replaced by the <12 subscripts in the second column. Again, 
comparing columns and ci>(?>2), 'we see that the gc subscripts are identical 
for both; while the bi subscripts of the one column are replaced by the 1)2 sub- 
scripts in the other. Finally, comparing columns w(ci) and w(c2), we note that 
the ah scripts are identical for both; while the Ci subscripts of the one column 
are replaced by C2 subscripts in the other. 1 

Fifth, let us form the column summations S«(ai), Sw(a2) ; Sctf(bi), '2u(bi) ; and 
2 ij(ci), So) (02) . Since the columns w(ai) and £0(02) differ only with respect to the 
a subscripts, the difference in value between the sums 2 co(oi) and Sa)(ai) can be 
due to differences in value between ai and Oi only, and are not at all affected 
by differences in value between hih and cjcj. Sw(oj) and 2 to(G 2 ) may, therefore, 
be regarded as the weights of ai and at to be used in determinmg the adjusted 
value of a; for S<i)(Gi) -f Sw(aa) = 1 . 

We may, then, write the following relations: 

o = Sw(ai)'ai -b 5w(o2)-02 = S«(oi)-(oo — Aa) + 7^u{ai)‘{ao -f- Aa) 

(®) 

= (S<o(ai) -f* Sco(fl2))‘ffio "t* (2 w(g 2) — Sw(oi))‘Ao = Oo "b (?(tt)‘Aci. 

Since precisely similar reasoning applies to the parameters bi, ba and Ci, cs, 
we have the following definitive formulas for computmg the values of theta; 

6(a) — Sco(a2) — Sa)(oi). 

d(h) = S«(h2) - S£o(bi). (6) 

d(c) = Sa)(C2) — Sw(cj). 

As the adjusted values of parameters, we have: 

a == Go + 6(a) • Aa. 

6 = 5o -b 9(b)-A6. (7) 

c = Co -b 6(c)- Ac. ■ 

In this development of the theta technique, we have determined (r,i, from the 
theoretical residuals y,ii — y. This has served well the purposes of exposition; 
but, since the true values of the function y are unknown, we must, in practice, 
compute an, from the observational residuals yni — Y. Later in the memoir, 
it will be shown how the computation of 6 may, in numerous cases, be con- 
siderably abridged. 


Partn 

III. The Principle of Malthus. Since a determination of the numerical 
parameters of a given function by means of the theta technique must, at best. 
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involve a considerable amount of computation, I have chosen for purposes of 
demonstration a problem which is of much interest in itself. This problem, we 
shall state in the form of two questions: 

First, what is the most appropriate mathematical form of the law of organic 


growth? 

Second, how may the parameters of the indicated function be computed? 
Thomas E. Malthus, in his famous essay on The Principle of Population Growth 
assumed that the proportional growth of human populations is properly defined 
by the differential equatiori, 


1 

V 



( 8 ) 


where p is the population under consideration, t is the measure of time, and h is 
the stable or geometric rate of growth. 

This formula has been destructively criticised on the ground that it fails 
wholly to give a mathematical description of the manner in which population 
growth is kept within bounds. So far as any implication of the formula is 
concerned, populations may grow to infinite magnitudes. An attempt to 
represent growth by its use must, therefore, result in a succession of discontinui- 
ties which are incompatible with the observed facts of organic growth. 


IV. The Symmetric Logistic. In three memoirs published in 1838, 1845 
and 1847, it was suggested by M. Verhulst, Professor of Mathematics in the 
Ecole Militaire m Brussels, that the rate of population growth might be stated 
as a function of the population itself. Assuming the limiting value of p to be 
Hf this conception of the growth rate Verhulst expressed by the differential 
equation, 

i . ^ _ 6(1 _ pi7-i). (9) 

"P dt 

Since this equation expresses proportional growth as a linear function of p, 
it is the simplest relation of its kind that may be conceived. In representing the 
rate of growth as a quantity which approaches zero as the population approaches 
its limiting value, it makes, indeed, a significant advance over the Malthusian 
formula. Nevertheless, the equation is subject to an interesting limitation, 
the nature of which is made evident by an examination of the integral form of the 
function, namely: 

p = H:[l -f e“W]. (10) 

This we shall now prove to be rotationally symmetric with respect to the point 
of inflection. 

Differentiatmg equation (9) a second time, we have, 

(Pp = — 6 dp[p{\ — II~^p)]dt 

= p[p-Hp"^ - b di'i + bE-ip dH -f- hH-Hp dt] 

= bH~^p dp dt. 
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Hence, 

^ = b^p(l - H-^vY ~ VH-YH - H-^v)- 


Setting 


d^p 

dt^ 


0, we get, 


Or 


1 - 2H~^p = 0. 


P == H/2, (11) 

which gives the value of p at the point of inflection. 

Substituting for p from (10), and solving for t, we have, 

t, = -a/b, ' (12) 

where is the point of inflection of the function p. 

Denoting the magnitude of the population at time i, by p,, its magnitude at 
time t,+k by p,+k, and its magnitude at the time L,-k by p,-i, we have, 

p, = H:[l -f- e“+i>c-a/w] = }jj2. (13) 

p.+fc = H:[l + = H:[l + (14) 

Pi.k = H;[H- = H:[l + (15) 


Measuring p in units of H and setting u = we may rewrite these last 
three equations as follows: 

H~'p, = 1/2. 

H~'-p,+i. = 1:[1 -f w]- 
H-^pi-i = 1:[1 + u-^]. 

On the hypothesis of rotational symmetry, we have, by subtraction, 

H-^p^^j. ~ 1/2 = 1/2 - 


In proof, we have : 

1;[1 + w] = 1 - l:[l + u-i] 
=: 1i~U[l + 


= l:[u 4- 1]. 

q. e. d. 


Part m 

V. Criticisms of the Logistic. Because of its symmetric form, many critics 
have called into question the finality of the logistic as a universal repre- 
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sentation of population growth. That it applies in particular cases, they con- 
tend, is no reason for holding that it must apply m general. Professors Raymond 
Pearl and Lowell J. Reed of Johns Hopkins University — to whom we are in- 
debted for the rediscovery of the earlier researches of Verhulst — ^ha-''e proposed, 
as the proper form of the generalized growth curve, the following function: 

p = R:[l + (16) 

In their view, this equation is suited not only to representing a smgle cycle of 
growth, but two successive cycles as well. This claim, however, must be 
rejected; for, if true, it would mean that one cycle of growth is predictable from 
another, a circumstance which is clearly inconsistent with the assumptions laid 
down by these same investigators. 

Moreover, so far as I can learn from their published writings, these authors 
have never considered the implications of the differential form of the function 
they propose. 

Differentiating (16), we have, 

- ■ ^ — {b 2cx + 3dx^) (1 — 

V at 

Here, we find the stable growth constant of Malthus replaced by an expression 
which is quadratic in t. This means that, for a population which is freed of a 
restraining limit, proportional growth tends generally toward infinite values. 
If there are any facts to support such a conception of organic growth, I do not 
know what they are, and must, perforce, reject the contention that equation 
(16) is the generalized form of the "Verhulst function. 

VI. Fundamental Assumptions. In order to represent the phenomenon 
of population growth mathematically, I hold the following assumptions to be 
necessary: 

(a) Under favoring conditions, population may increase at a constant geo- 
metric rate. 

(b) Under all circumstances, the rate of growth must be a finite and continuous 
quantity. 

(c) The magnitude of a population is always a positive, real number. 

(d) The growth of population tends toward restriction within definite bounds. 

(e) The growth of population is a function of time. 

(f) The basic conditions of growth are free of cataclysmic disturbances. 

The first of these assumptions is given in recognition of well known facts 

concerning organic growth. The second is necessary because, even when the 
size of a population is freed of definite restriction, the pattern of growth is not 
necessarily geometric. The third assumption affirms the absurdity of represent- 
ing a population as a negative or infinite quantity. The fourth merely asserts 
the indisputable f ^ct that the organism must always grow in a finite environment. 
The fifth gives place to the concept of growth as the resultant of a complex of 
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causes, no one of which can be isolated as an entirely independent variable. 
'While the final assumption recognizes that major disturbing influences may 
profoundly affect xhe course of growth. 

VII. The Skew Logistic. In accord with our fundamental assumptions, we 
may form the following differential equations; 

1 dr>' 

— • = — [h + sm • cos (m(< + ?))] [1 — H~'^p'] Type a 

P Oil 

= - [6 + 2&mKt + g) : (1 + mH( + g)»)] [1 - Type ^ (17) 

= —[6 4- sm?{t + g) : a/I + w?{i -|- qy] [1 — E^^p'] Type y 

In these equations, p' = p — L, and measures p from its lower limit as origin. 
On separating variables, the following integrations may be performed; 

- j [dp':(p'(l - H~Y))] = - log Ip': (1 - H-V')] = log[(^ - P'y.iHp')]. 

Writing z = m{t -j- q),dz = mdt\ ao that we have: 

bjdt a J cos z dz = A bt s • sin z. 

b j di + 2s j [z;(l + z'‘)]dz = 4. + ?>< + 8 • log (1 + g*). 

h j dt + s j [z:\/l 4 - z*]dz = A -{■ bt + s \/l A- 

From these integrals, we form the following equations : 

log [(1? ~ p') : (.Hp')\ = .4 4- hi 4- s • sin lm{t + g)]. 

logoff — p')'(Hp')] = 4 4- hi 4- s - log [1 4- mKt 4^ ?)*]• 

log [(H - p') : (Hp')] = A + bi + s- Vl 4- + g)'- 

We have, finally, on taking antilogarithms and making the substitutions 
p — p' + L, a = A — log E: 

p = L -j- E :[! + Type a 

p = L + E:[l + e«w+.-iog(i+mK*+*i«)]. Type d (18) 

p = L -h E :[! + fi»+’>'+«Vi+»n»((+a)>]. Typey 

These equatiotis give the normal forms of the skew logistic. 

VIII. Properties of the Skew Logistic. We may deduce the properties of 
the skew logistic by examining both its differential and integral forms. Con- 
sidering the derivative of Type a, we note that the Malthusian constant h is 



172 


HABRY SYLVESTER WILL 


replaced by a trigonometric function whose amplitude is 6 ± sm, and whose 
phase depends on the values of m and g. When b ± sm = 0, the derivative 
must also equal zero, and a flat point in the curve of p is indicated. When b is 
absolutely less than sm, the derivative changes sign and the curve of p reverses 
its direction. Thus, the integral form of Type a modifies the symmetric form 
of the logistic by a succession of minor cycles in which the rate of growth is 
alternately accelerated and retarded. 

Considering Type we find the Malthusian constant replaced by a function 
whose maximum and minimum values are attained when t = m~i — q. Obvi- 
ously, therefore, this function passes through a single period whose amplitude 
is b ± sm, and whose phases are b, b sm, h,b — sm, b. When b ± sm = 0, 
a flat point m the curve of p is generated. The effect of skewness on the rate of 
growth passes through two double phases. Where b and s are of the same sign, 
these phases arc : first, increasing retardation followed by decreasing retardation 
when f -b g is negative; and, second, increasing acceleration followed by decreas- 
ing acceleration when f •+• g is positive. Where b and s are of opposite sign, 
the corresponding phases are : first, increasing acceleration followed by decreas- 
ing acceleration when i -f g is negative; and, second, increasing retardation 
followed by decreasing retardation when i -f g is positive. It is to be noted 
that, when sm is absolutely greater than b, the derivative will change sign twice 
before the upper limit is reached. Under these circumstances, the function p 
passes through a double reversal of direction. 

Considering Type y, we find the Malthusian constant of the derivative re- 
placed by a function which is aperiodic and which approaches the limits h ± sm 
as i approaches ± «j. When b and s are of the same sign, skewness passes through 
the two following phases : first, the phase of decreasing retardation when t -f g 
is negative, and, second, the phase of increasing acceleration when t ■+- g is 
positive. On the other hand, when b and s are of opposite .sign, the correspond- 
ing phases are : first, that of decreasing acceleration when ( -f g is negative ; and, 
second, that of increasing retardation when ( -f g is positive When sm is 
absolutely greater than b, the derivative changes sign, and the function p passes 
from a continuously increasing phase to a continuously decreasing phase, or 
mce versa. 

In general, it may be said of all three types — a, 0 and y — that, if the derivative 
is not restnfeted to a single change of sign, L denotes a lower asymptote of the 
function p; while, under the same conditions, H denotes the higher limit ap- 
proached by the function p — L. When H is negative, the effect is to make L 
an upper, and L — H a lower, asymptote of the curve p. 

In the case of Type y, when the function p makes a single change of sign, 
either BoxL becomes a maximum (or minimum) value instead of an asymptote 
of the curve. In this event, it will be noted that the factor 1 — H~^p appearing 
in the derivative does not approach zero as a limit with increasing values of t, 
but rather passes through a minimum and then approaches the liimt 1 in either 
direction. 
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The parameter s may be positive or negative in sign, and is termed the index 
of skewness or, briefly, the skewness of the function. Obviously, m is always 
positive, and, since it determines the rate at which skewness develops, is properly 
termed the development. The point in time at which skewness passes from an 
accelerating to a retarding phase, or vice versa, is fixed by the value of q, which is, 
therefore, termed the transition. The parameter b, as has already been stated, 
is termed the stable giowth tendency or, technically, the stability of the function. 
And since the position of the cuiwe p on an arbitrary time scale wdll vary with 
the value of a, this parameter I have designated the location. 

In all three types of the skew logistic, if is a continuously decreasing func- 
tion and both H and L are positive, the curve of p may be described as of the 
rising hillside form In the case of Type y, if the derivative changes from 
positive to negative sign, the curve may be described as mountain formed. If 
c'/'O) increases continuously, the curve is of the falling hillside variety, except 
when the derivative of Type y changes from negative to positive sign, in which 
event a valley form is generated. 


Part IV 

IX. Parameters of the Symmetric Logistic. The numerical parameters of the 
symmetric logistic (10) are most easily determined by the method of differences. 
First, we write, 

p;' = C -H e^+“, (19) 

where C - A = a — log H; and i = 0, 1, 2, • • n - 1. 

Assuming At constant, let us give to t the increment kAt, thus; 

PT+k = C + (20) 

Subtracting (19) from (20), we obtain 

A]^p~^ — = Be-*'^^‘, ( 21 ) 

where B = 1. The quantity A^p',^ = p~Xi, - p~^ is termed a first order 

difference of rank k. 

Giving to t m equation (21) the increment kAt, we get 

A„p-lk = ( 22 ) 

Dividing (22) by (21), we have, 

Taking logarithms, we obtain 

A^ log Aip~^ = log Aip7+i - log = hkAt, 
which defines the parameter h. We can form n — 2k such equations. Hence, 
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b is uniquely determined by the relation 

& = log A;tP7^] :[A:(n - 2k) At] 

= log log A.P;^] :[k{n + 2fc)A<], (23) 

where k = n:3 to the nearest integer. 

Eetnming to (21), we have the following relation determining the value of A ; 
4 = iogE::r-“A,pr] - log 

= log - log [b (24) 

where k = -11:2 to the nearest integer. 

From equation (19), we have 

c = E:=r' ei -n. (25) 

The values of H and a are, obviously, given by 

B = C-'. ' (26) 

a = A + log H. (27) 

In the relations defining h, A and C, the values of P must be obtained from 
the observations. In computing the values of fc, the formula is : 

k = n{r + 1)"^, 

where n is the number of observations, "and r denotes the order of reduction 
involved in the defining relation. 

In my first treatment of the subject, I assumed that the value of k for all 
orders of reductiob might be determined from the reduction of highest order 
involved ; but I have since found that I erred in this view. The point is that the 
function ^(p) = k''(n — rk), discussed in the original memqir, must be maxi- 
mized with respect to k separately for each order of difference involved; or, in 
' other words, the rank constant k must be given a separate determination for 
each parameter defined if the most accurate results are to be obtained. 

X. Parameters of the Skew Logistic, I shall now show how the method 
of differences may be used to abridge the computations involved in applying 
the theta technique to the determination of the parameters of the skew logistic. 
In this, as in the preceding section, we assume At constant. 

Operating on Type 7 of equation (18), we write 

, p, = L -f- B • [1 -t" «“+*'+* Vi+>n‘«+8J*J . (28) 

To begin with, let us write the transformation of ordinate 
G = log [ff(p - L)-i _ 1], 

Also, let us write 

P = Vl + iri^it -f q)^ . 
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We may now rewrite equation (28) m the form 

G, = a + U + aF,. (29) 

Giving to t the increment kM, we have 

G,+fc = a -{- b(< + fcAi) + sF,+i. (30) 

Subtracting (29) from (30), we have, 

AiG. = 6A:A< + sA^F,. (31) 

Again giving to t the increment kAt, we obtain 

AkOt+ib = b/cA(f -b kAf) -}- sA^Fj^i. (32) 

Subtracting (31) from (32), we obtam ' 

AtG,+i - AiG, = (bkAt - bkAt) + s(A/tF,+j, - A^F,), 
or 

AlG, = sAlF,. (33) 

We can form n — 2k such equations, and may, therefore, form n — 2k ap- 
proximations to the value of the parameter s, as follows: 

s. = [Afc(?,]:[A^F<]; t = 0, 1, • • • , n - 2fc - 1. 


Taking the mean value of the set s, as its most probable value, we have, 

St){HL’tnq) — Ss,:(n — 2fc); fc = n:3 to the nearest integer (34) 

In this determination of so , the only parameters directly involved are H, L, m 
and q, the parameters a and b having been eliminated. By assigning values to 
Ho, L^, mo and qo, we may, on setting up the arbitrary corrections AH, AL, Am 
and Aq, write down the following: 

Hx = Ho — AH', Hi = Ho "b AH', Li = Lo — AF,' Li = Lo -b AFj 
mi — mo — Am; = mo + Am; gi = go — Ag; ga = go -b Aq. 

Since So is a function of H, L, m and q, we may, by entering the subscripts of 
the combination HL-mq, tabulate the possible determinations of So as follows: 


11-11 

11-12 

11 21 

11-22 

12-11 

12-12 

12 21 

12-22 

21-11 

21-12 

21-21 - 

21-22 

22-11 

22-12 

22-21 

22-22 


In this tabulation, the subscripts of parameters are in the ord,er of HL-mq; 
so that 12 '21 denotes so{HxLi-mtqi), etc. 

From the table, it is seen that we may compute 2* = 16 distinct sets of approxi- 
mations to SoiflL-mq) Since the true values of H, L, m and q are unknown, 
each set of approximations s,' will show a characteristic variation about its mean 
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value, So . This variation is most conveniently measured by the mean deviation 

6 = (s„ - s',)2N':N = (s'o' - s,)2N"\N, (35) 

where the second relation serves as a check on the computation by the first; 
N = n - 2k-, N' denotes the number of items s, which are less than so in value, 
and N", the number of items s, which are greater than So in value ; while Sq denotes 
the mean of the N' values of s, which are less than So, and So^ the mean of the 
N" values of s, which are greater than so . 

The reliability of a given value of So as a measure of the central tendency of 
the corresponding set s, is sufficiently determined by e~^, which serves at the 
same time to measure the reliability of the combination HLmq figuring in the 
computation of the given set s, . We may, therefore, compute the values of the 
test constant, w, directly from the values of by means of the relation, 

u){HL-mq) = n + ^ii n + • • + £ 22 - 22 ] ~ (36) 

where j = 11 -11, 1M2, •• , 22-22; == 1. 

Since four values of theta are to be determined, we must arrange the sixteen 
values of omega in four ways, as shown by the following tabulation of subscripts : 


«{H0 


«(ii) 

wfi/j) 

u(wil) 


u(?l) 

£o(gj) 

11-11 

21-11 

11-11 

12-11 

11-11 

11-21 

11-11 

11-12 

11-12 

21-12 

11-12 

12-12 

11-12 

11-22 

11-21 

11-22 

11-21 

21-21 

11-21 

12-21 

12-11 

12-21 

12-11 

12-12 

11-22 

21 22 

11-22 

12-22 

12-12 

12-22 

12-21 

12-22 

12-11 

22-11 

21-11 

22-11 

21-11 

21-21 

21 11 

21-12 

12-12 

22-12 

21-12 

22-12 

21-12 

21-22 

21-21 

21-22 

12-21 

22-21 

21 21 

22 21 

22-11 

22-21 

22-11 

22-12 

12-22 

22-22 

21-22 

22-22 

22-12 

22-22 

22-21 

22-22 


Knowing the values of omega, we have at once. 


e{H) = MHi) - MHi)-, 

0{L) — '2u{Li) — Sw(Li); 


6{m} = 2w(to 2) — I)«(mi); 

e(q) = 2 «(e 2 ) - Sco(gri). 

(37) 

H = Ho + e{H)-AH-, 

H — Lo -(- ^(fv) - AT; 


m = mo + 6(m)-Avi; 

Q = Qo + e{q) -Aq. 

(38) 

The process of adjustment should be 

repeated until errors in the parameters 


diminish to negligible proportions. 

With H, L, m and q known to a sufficient approximation, we may form anew 
the functions GiH, L, m, q) and F{H, L, m, q). We can then write n - 2k 
equations of form (33), viz.: 


AlG, = sAlF,. 
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Summing these equations, we have, 

2AlG, = sS^lF,, (39) 

where XAfG, == 

2A|F. = - 2^lZr^-^F, + ; 

where k = n;3 to the nearest integer. 

The approximate value of s is now obtained from the relation 

s = (4o) 

Returning to equation (31), we solve for bkAt, obtaining, 

hkAt = AiG, - sAiFi. 

Since we can form n — k such equations, the approximate value of h is given 
by the relation 


b = [SAaG, — s1,AkF^]:[k{n — ft)Ai] 


(41) 


[CL\zVG, - S'lr-^G,) - - i:izr'‘~%)]:[k{n - k)At], 


where /c = n:2 to the nearest integer, 
from equation (29), we obtain the approximate value of a as follows: 

a = [LlZVG, - - s j:\Zr%] -n. 


(42) 


Comparing the abridged method of computing the values of theta here out- 
lined with the general procedure of section II, it will be seen that we have been 
able to reduce the number of values of omega which it is necessary to determine 
from 2^ = 128 to 2^ = 16. In cases where L may be assumed to equal zero, 
the number of values of omega which must be computed is further reduced to 
2 ** = 8 . 


* Party 

XI. Symmetric Parameters for the Population of the TTnited States. I 

have determined the numerical values of the parameters of both the sym- 
metric and the skew forms of the logistic from the population figures for the 
United States given by the Bureau of the Census. The only departure in the 
data from the census figures consists in the interpolation of all items to June 
1st as the date of observation. The values of the symmetric parameters are 
computed from the data of Table I, as follows. 

Setting fc = 154-3 = 5, we have, by equation (23), 

2o As log AsPZ^ = 2s log -^5^7^ ' 2^ log AsPT^ 

= 9.71878n - 5.14555n = -3.42677. 
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TABLE I 


Data for the Symmetric Logistic 


i 

p-i 

.AaP-^ 

logAsP ^ 

IQSt 

0 

0 25582 

-0.19724 

i 29500„ 

1.00000 

1 

0.18939 

-0 14627 

I.16516n 

0.72934 

2 

0 13886 

-0 10704 

i.02965„ 

0.53193 

3 

0.10431 

-0.07838 

2 89421„ 

0 38796 

4 

0.07770 

-0.05776 

2 76163„ 

0.28295 

5 

0 05858 

-0.04269 

2.63033„ 

0.20637 

6 

0.04312 

-0.02996 

2.47654n 

0.15051 

7 

0 03181 

-0 02098 

2.32181„ 

0 10978 

8 

0.02593 

-0.01650 

2.21748„ 

0.08006 

9 

0 01994 

-0.01182 

2.07262„ 

0.05839 

10 

0.01589 



0.04259 

11 

0.01316 



0.03106 

12 

0.01083 

• 


0.02265 

13 

0.00943 



0.01652 

14 

0.00812 



0 01205 

S 

1.00288 

-0.70864 

il.86433„ 

3.66216 





TABLE 11(a) 






Data for the Skew Logistic 



t 

ffi 


Oi 

Fii 

Pa 

Fn 

Fa 

0 

+ 1.67998 

+ 

1.71132 

6.47765 

3.36261 

9.65194 

4.90306 

1 

+ 1.54968 

+ 

1.57779 

5.68859 

2 60000 

8.45931 

3,73631 

2 

+ 1 40690 

+ 

1.43878 

4.90306 

1.88680 

7.26911 

2.60000 

3 

+ 1.27698 

+ 

1.30927 

4.12311 

1 28062 

6.08276 

1.56205 

4 

+ 1.14130 

+ 

1.17416 

3 35261 

1.00000 

4 90306 

1.00000 

5 

+ 1.00816 

+ 

1 04179 

2.60000 

1 28062 

3.73631 

1.56205 

6 

+ 0 85948 

+ 

0.89428 

1.88680 

1 88680 

2.60000 

2.60000 

7 

+ 0.70640 

+ 

0.74193 

1.28062 

2.60000 

1.56205 

3.73631 

8 

+ 0.69699 

+ 

0.63515 

1.00000 

3.35261 

1 00000 

4.90306 

9 

+ 0.44841 

+ 

0.48966 

1 28062 

4.12311 

1.56205 

6.08276 

10 

+ 0.30840 

+ 

0.35346 

1.88680 

4 90306 

2.60000 

7. 2691 i 

11 

+ 0 17992 


0.22981 

2.60000 

5.68859 

3 73631 

8.45931 

12 

+ 0.02885 

+ 

0.08647 

3.35261 

6.47765 

4.90306 

9.65194 

13 

- 0.09590 


0 02968 

4.12311 

7.26911 

6.08276 

10.84620 

14 

- 0 25808 

— 

0.17670 

4.90306 

8.06226 

7.26911 

12.04159 

2 

+10.83647 

+11.47739 

49.45864 

55.76384 

71.41783 

80.95375 
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TABLE 11(b) 

Data for the Shew Logistic 


1 

A^Gi 

A5(?2 

Asm'll 

Aji^ia 

A^fai 

As^’aa 

0 

-0.02794 

-0.01880 

3.16445 

5 69443 

4 77932 

9.04807 

1 

+0.01064 

+0 . 01904 

4.51499 

4.51499 

6.99562 

6.99562 

2 

+0 02495 

+0.04139 

5.69443 

3 16445 

9.04807 

4 77932 

3 

-0.01290 

+0 00929 

6 24622 

1 84451 

10 16552 

2.60213 

4 

-0 01360 

+0 01834 

5.69443 

0.81604 

9.04807 

0 87607 

S 

-0.01885 

+0.06926 

25 31452 

16 03442 

40.03660 

24 30121 


We note that k(n — 2k) At = 5(15-10)1 = 25; hence, 

6 = -3 42677 4- 25 = -0 1370708. 

Next, set fc = 15 4- 2 = 7, to the nearest integer; then, by equation (24), we 
get 

SoA^P;^ = - E’oP7‘ = 0.10380 - 0.86777 = -0.76447; 

B == 10*>*^^' - 1 = - 1 = -0.89022; So == 3.39884. 

Hence, 

A = log [-0 76447] - log [-0 89022 X 3.39884] = 1.4025324. 

We have next 

= 1.00288; SJ' 10*' = 3.66216; 10^ = 0.25266. 

By equation (25), then, we obtain 

C = [1.00288 - 0.25266 X 3.66216] - 5 - 15 = 0.0051747. 

By equation (26), we get 

= C-i = 193 25. 


Finally, by equation (27), we obtain 

a = A + log H = i 4025324 + 2.2861136 = 1.68865. 

The point of inflection of the curve is given by 

f, = _a:& = 1.68865 4- 0.1370708 = 12.319. 

XII. Skew Parameters for the Population, of the United States. Assuming 
X = 0, we form 

Hi = 198 0 - 7 0 = 191.0; H, = 198.0 + 7.0 = 205.0. 

OTi = 1 0 - 0.2 = 0 . 8 ; TOa = 1.0 + 0.2 = 1 . 2 . 

= —6.0 — 2,0 = -8.0; g'a = —6.0 + 2.0 = —4.0. 
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Next, the primary data of Tables 11(a) and 11(b) are computed. Setting 
fc = 15-^5 = 3, w — fc values of the 2^ sets of s, are determined and entered 
in Table III (a). The values of so , « and o> for each set are computed by equa- 
tions (34), (35) and (36). 

In Table Ill(b), the several values of to are arranged according to their associa- 
tion ; first, with Hi, Hi] second, with mi,nui] and, third, with qi,q 2 . The column 
sums 3 deld the weights 2co. The values of 9 and the adjusted values of param- 
eters are computed by equations (37) and (38) ; 


TABLE III(a) 

Data for the Computation of 0 


t 

S(I 11) 

s(1.12) 

»(1.21) 

«(! 22) 

«{2.11) 

s(2 12) 

s(2 21) 

s(2 22) 

0 



-0 00585 



IBq 

-0.00393 


1 



+0.00152 






2 



+0 00276 






3 



-0 00127 



BQR^ 



4 

-0 00239 

-0,01667 





QB 


S 


||||QR|R ! 

-0 00434 



+0.04161 



So 



-0 00087 





+0.00676 

€ 



+0,00241 





+0 00643 

(0 


lillB 

+0 25711 

EBm 

BB 

BH 

+0 00290 

+0 03061 


TABLE III(b) 


Data for the Computation of 0 




aiht) 

M(mi) 


"(Si) 



0 1062 

BBB 


0 2571 

mBM 



0 0301 



0 0492 

BbI 



0.2571 



0.3029 


0 0914 


0.0492 



0 0360 

0.3029 

0 0360 

2 

0.4426 

1 

0.6674 

0 3548 

1 

0 6462 

0 7933 

0 2067 


TABLE IV(a) 




Summary 

of Adjustments 


Parameter 

Eetimated 

Value 

A 

e 

A-e 

Adjusted 

Value 

H 

+198.0 

+7.0 

+0.1148 

+0.8036 

+198.80 

m 

+ 1.0 

+0.2 

+0 2904 

+0.05808 

+1.05808 

q 

- 6.0 

+2.0 

-0.5866 

-1.1732 

-7.1732 
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TABLE IV(b) 
Final Transformations 


i 

GiRmq) 

FtHmq) 

0 

1 69772 

7 65559 

1 

1.56410 

6,60800 

2 

1.42495 

5 46440 

3 

1.29526 

.4.52752 

4 

1.15991 

3.50336 

5 

1 02722 

2 50753 

6 

0.87921 

1 59408 

7 

0 72613 

1.01784 

8 

0.61866 

1.32865 

9 

0.47182 

2 17626 

10 

0.33408 

3 15374 

11 

0 20842 

4.17075 

12 

0 06189 

5.20418 

13 

1.94223 

6.24580 

14 

1.78913 

7.29229 

S 

11.20073 

62 54999 


Finally, the functions G and P are formed anew from the adjusted values of 
' H, m, q The adjusted values of s, b and a are computed by equations (40), 
(41) and (42), as follows: 

s = - 2ZIG. + SoG'J:[I:UF, - 2ZIF, + 

= [0.33574 - 2 X 3.72304 + 7.14194] s- [26.06676 - 2 X 8.62436 

4- 27.85887] 

= 0.03161 36,67691 = 0.00086185. 

i = - TIG, - - Ttmm - mt] 

= [1.42623 - 9.04837 - 0.00086185(29.57167 - 31.96048)] h- [7(15 7)1] 

= [-7.62214 - 0.00086185 X (-2.38881)] [56] = -0,13607. 

a = - Eo'*^ - 

= [11.20073 - (-0.13607 X 105) - 0 00086185 X 62 54999] 15 

= 1,69561. 

In the present case, the values of Hq, ma and ^'o were known within definite 
limits from previous experimentation. The values of the corrections, d-A, 
were, on this account, smaller than should ordinarily be expected from a first 
application of the technique. Always, it is necessary to take A sufficiently 
large to insure e < 1. As a preliminary step, it is not infrequently advantageous 
to compute trial values of e by holding constant each two of the parameters 
Ho, mq and qo while experimenting roughly with the third. 
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TABLE V(a) 
Ordznates of Fitted Curves 


Year 

Census 

Count 

Symmetric 

Ordinates 

Percentage 

Deviations 

Skew 

Ordinates 

Percentage 

Deviations 

1790 

3.909 

3.88 

-0 78 

3 87 

-0 01 

1800 

5.280 

5.28 

-0.03 

5.27 

-0 25 

1810 

7.202 

7 16 

-0.52 

7 15 

-0 73 

1820 

9 587 

9 69 

+ 1 07 

9 67 

+0 88 

1830 

12.866 

13 04 

+ 1 37 

13.02 

+1 20 

1840 

17 069 

17 45 

+2 22 

17 42 

+2 09 

1850 

23 192 

23 15 

-0 20 

23.13 

-0 28 

1860 

31.443 

30 38 

-3 36 

30 37 

-3 42 

1870 

38.558 

39.36 

+2 09 

39 31 

+ 1.95 

1880 

50.156 

50.18 

+0.05 

50 07 

-0 18 

1890 

62 948 

62.61 

-0 31 

62.60 

-0.55 

1900 

75.995 

'76 79 

+1.05 

76.64 

+0.86 

1910 

92.329 

91.76 

-0 62 

91 72 

-0.67 

1920 

106 001 

106 96 

+0.90 

107.16 

+1.09 

1930 

123.068 

121.66 

-1.14 

122 23 

-0 69 





TABLE V(b) 







Extrapolations 




Year 

Forecast 

Sym. 0. 

Sk.O 

Year 

Sym 0. 

Sk 0 

1940 

137.20 

136 22 

136.26 

1780 

2.844 

2.850 

1950 

149.29 

147.18 

148.78 

1770 

2 083 

2.095 

1960 

159 88 

157.33 

159.52 

1760 

1 523 

1 539 

1970 

168 71 

165.66 

168.42 

1750 

1.113 

1.130 

1980 

175.83 

172.33 

175 59 

1740 

0 813 

0 829 

1990 

181.46 

177 . 52 

181 25 

1730 

0 594 

0.608 

2000 

185 82 

181 52 

185.63 

1720 

0 434 

0 445 

2010 

189.14 

184.55 

188.98 

1710 

0.316 

0.280 

2020 

193 . 11 

186 82 

192.97 

1700 

0 231 

0 238 

2030 

198 54 

188.52 

193 40 

1690 

0.168 

0 173 

2040 

194.94 

189.77 

194.83 

1680 

0.123 

0 127 

2050 

195.98 

190 72 

195 87 

1620 

0.090 

0.092 

2060 

196.75 

191.39 

196.64 

1610 

0 065 

0 067 

2070 

197 31 

191 88 

197.22 

1600 

0.048 

0.049 

2080 

197 73 

192.25 

197.64 

1590 

0 035 

0 036 

2090 

198 03 

192.52 

197.94 




2100 

198 25 


198.17 




2110 

198.42 


198.34 




2120 

198.54 


198 46 




2130 

198 63 


198 55 
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Part VI 

XIII. General Considerations. The technique of solution for the numeri- 
cal values of parameters presented in the foregoing pages is generally applicable 
to continuous functions of real variables. The abridged procedure may be 
followed whenever the given function involves a component which is linear in 
certain of the parameters; for, in such cases, it is always possible to effect a 
transformation of ordinates which will permit of the elimination of the param- 
eters of the linear component. In any event, the equation of the function 
may be solved for a single parameter which may then be employed, as m our 
illustration, as a means of determining the values of the test constant, omega. 

XIV. An Interpretation of Results. The equations of the symmetric and 
skew logistic curves as computed for the population of the United States are, 
written to the natural base, as follows : 

V = 193.25 ;[1 -f 3 ib62,]_ 

p ~ 198.80 :[1 -f- omssisvi+l 068!»(1~7.1732)1] 

The amount of skewness in the second of these equations, as measured by 
the value of s, is small; but, owing to the fair size of the parameter m, it de- 
velops rapidly and affects the form of the curve sensibly. The major effect 
is to raise the value of the limiting population as given in the first equation by 
about six millions and to prolong the period of growth by about forty years. 
The approximate limit of 193 millions in the symmetric form is reached about 
the year 2090; while the approximate limit of 199 millions of the skew form 
’ is not arrived at until about the year 2130. 

The positive sign of s makes for a decreasing acceleration of the rate of in- 
crease during the earlier phases of growth and for an increasing retardation of 
this rate durmg the later phases, the value of q fixing the point of transition 
in the year 1861. This general epoch has often been cited by sociologists as 
marking the shift from a dominantly rural-agricultural civilization to a domi- 
nantly ,urban-industrial one. The point at which the change takes place has, 
to my knowledge, never before been defined mathematically. 

Both curves fit the observations excellently, as shown by the percentage 
deviations of Table V(a). The forecasted growth presented in Table V(b) 
is based on the skew ordinates, the formula being 

Pi = (43) 

where P denotes the actual population senes, observed or predicted, and p, 
the skew ordinates. The assumptions of the formula are two: first, that it 
is the observed population Pu which initiates the forecasted series; and, second, 
that the influence of the correction factor Pu/pu diminishes with the time. 

The extrapolations of both the skew and symmetric formulas contrast with 
the results obtained by Doctors Dublin and Lotka, who predict a stationary 
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population of 150 millions by 1970 For the same year, the ordinates of both 
the skew and symmetric curves exceed this figure, the one by 15.66, and the 
other by 18.42 millions 

The limit of 150 millions referred to was arrived at by analysis of current 
tendencies in birth and death rates. The argument is that current birth rates 
are spuriously high and current death rates spuriously low because of the 
abnormally high proportion of men and women in the reproductive ages. This 
circumstance is due, m part, to the influx in the past of immigrants from com- 
munities having a high normal birth rate, and, in part, to the high birth rates 
of preceding generations of parents in this country. 

After computation of the necessary corrections has been made, the true rate 
of natural increase of the white population for the registration area of the United 
States for the year 1920 is seen to be only about 5.4 per thousand instead of the 
10 7 per thousand indicated by the crude rates. For the year 1930, the actual 
rate of increase is 7.5 pec thousand; while the corrected or true rate turns out 
to be virtually zero. Under the interpretation of the authoritie.s cited, the 
spurious excess of births over deaths will be entirely dissipated by the year 
1970, with the result of the stationary population predicted. 

The hazard peculiar to this method of inference arises from two assumptions 
that are made . first, that the present collection and registration of vital statis- 
tics is sufficiently reliable to make precise estimate of the true rate of natural 
increase possible; second, that the tendencies of fecundity and mortality ex- 
hibited by current data are stable. 

With respect to the first assumption, the authors have this to say: 

“One factor of safety of unknown magnitude remains. There is still some 
degree of laxity in the registration of births, and the figures of the true rate of 
natural increase may, on that account, be somewhat larger than recorded 
above.” 

The caution of the author.s in this statement is in contrast with the uncritical 
acceptance of their results by those who fail to grasp the implications of 
technique. 

Concerning the second assumption, it may be pointed out that many of the 
tendencies exhibited by current data must be regarded as statistically re- 
versible. Falling birth rates due to drift of population to cities, to postpone- 
ment of marriage on the part of professional classes, to the increasing cost of 
child culture, to the ubanization of rural life and to the restriction of immigra- 
tion may be definitely altered by reversals in tendency. The flow of popu- 
lation may move into extraurban and subrural districts, where birth rates are 
more favorable to increase. The cost of child culture may, in part, be socially 
assumed. Improvement in economic conditions may lessen the dram on the 
resources of the family. The tendency for rural birth rate,s to fall may be 
checked. Immigration may increase with improving economic conditions. 
Death rates may be further reduced in many age classes and for many causes. 

In fine, when we attempt to project into the future the components that 
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determine the trend of natural increase, we encounter risks which vastly exceed 
those involved in the projection of the population series itself. Most of the 
data from which component trends must be determined cover but a brief period 
of time ; while population data extends back for a century and a half. In this 
connection, it is not impertinent to inquire the criterion of relevance that will 
warrant a rejection of the items of the very series we are seeking to forecast. 

It is a cardinal principle of logistic theory that the growth of population 
depends primarily on the continued supply of basic resources, physical and 
social, and that the dissipation of these resources is registered in the growth 
rate of the population itsMf. Any tendency of a population series toward 
skewness, that is, toward departure from the symmetric type of growth, is 
more likely to persist if it is systematic in character. The skew forms of the 
logistic function which we have developed permit us to measure any existing 
systematic tendency of the data toward skewness, and, therefore, to improve 
on the symmetric expectation of future growth. 

In the case of the United States population, the evidence of skewness, insofar 
as it bears on the problem of expectation, is adverse to the conclusion that the 
ultimate limit of growth will be less than the symmetric asymptote. Conceding 
the light that the analysis of current tendencies may throw on the probable 
occurrence of future deviations from trend, the best criterion of long-time 
growth remains the logistic projection. 

This statement, to be sure, does not relieve us of the necessity for recognizing 
the nature of the hazard that inheres in making a prediction from a trend 
extrapolation. The hazard involved in this type of inference arises from the 
assumption that the basic conditions of growth are stable, or, in other words, 
that the values of the parameters of the forecasting formula will remain sub- 
stantially unchanged with the inclusion of new observations. Time alone can 
provide the final test of the contmued validity of this assumption. 

XV. The Law of Organic Growth. The law of organic growth in its most 
general form may be written : 

p = L -f H;[l -b (44) 

where Ui = -p q)]] = log[l + m\t -p qY]-, = Vl + rn^ii + qY- 

For most practical purposes, the evaluation of thirteen parameters is out of 
the question; hence, the restricted forms a, 13, and y, equation (18), will be the 
ones most generally employed. 

I have made use of the term law of organic growth with reference to the logistic 
forms developed because I believe these functions to be the best means yet 
devised for the representation of the sequential changes which living organisms 
regularly manifest as individuals or societies. It states, in a quantitative form, 
all that is qualitatively implied by the so-called “law of diminishing returns” as 
this is commonly invoked by economists. The special sense in which I have 
used the term law may be expressed as follows: 
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A sioitishcal law is a watheinaiical genevalization on the hehamov of a system of 
observations such that the implications of the formula are in accord with the assump- 
tions basic to the phenomenon observed, and such that evaluations of the parameters 
of the formula determined from random samples are mutually consistent. 

A statistical law, then, posits a system of relations manifesting itself in the 
form of observations which must be subjected to analysis before the true nature 
of their interrelations can be inferred. It expresses a probable, rather than a 
certain, inference; but, within the limitations of its claim to precision, it leaves 
reason no more free to reject its specification of reality than does a law of 
mechanics. Indeed, the point is still in dispute as to whether any law of science 
can be more than a statement of probabilities. 

In contradistinction, the term emptncal formula is properly restricted to cover 
the representation of the single set of observations at hand, and bears no neces- 
sary relation to any larger system. A sufficient test of an empirical formula is, 
therefore, the test of fit. 

We may fit an indefinite number of formulas to a population series and obtain 
satisfactory results so far as agreement is concerned; but, on extrapolating, the 
same formulas will yield results that are patently absurd. The backward 
extrapolation for the population of the United States shown in Table V(b) 
represents the known facts as closely as could be expected when we take into 
consideration that census enumerations include aboriginal and immigrant 
populations as well as native born. Certainly, no random empirical formula, 
selected on the ground of goodness of fit, could be expected to yield as satis- 
factory a result. 

Logistic theory does not, then, profess to guarantee infallibility of prediction. 
A population is not a mere aggregate of unrelated individuals inhabiting a 
restricted area, but a unified organization which grows by the utilization of 
total resources. When the supply of resources is profoundly disturbed or the 
basis of organizational unity destroyed, then the basis of prediction also is 
destroyed. And such reasoning is by no means peculiar to the sphere of social 
organization; for the integrity of any purely mechanical system is likewise 
conditioned by the assumption that the basis of coherence persists. 

At this point, those in whom the speculative disposition is strong may query: 
if statistical prediction does not yield a certain result, is it, in the final analysis, 
superior to the ready and far less expensive method of guessing? 

In answer, I can only say that, a posteriori, we can always, among a sufficiently 
large I batch of guessers, find someone who has guessed well; but how, a priori, 
are we to know the good guesser from the poor? A population series consists 
ofv definite magnitudes, and any prediction of its development must result in 
the selection, out of a vast array of possible magnitudes, that which is most 
consistent with all the known facts. The gambler may elect to hazard his 
stake on the result of a random estimate; but the prudent will give heed to the 
exacting, if laborious, procedure of mathematical analysis. 
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ADDENDUM 

Another solution of the theoretical problem stated in Section I may here be 
noted. 

Given, as before, the function y — f(x, a,i ■ ■ ■ ), we may, by assigning three 
approximate values to each parameter, compute 3" sets of values for the function 
y, thus. 

yn = /(a;, aibi • • ); aibi • • • ); Via = (x, aA ■ ■ ); etc. 

From the observations Y, we may compute 3^ .sets of the residuals y — Y; 
and from these several seta of residuals, the corresponding standard errors of 
estimate, a, may be computed for each set of values of the function y, thus, we 
have: 

ffji = 0(F, X, aibi) 
ffii = tp(.Y } ic, aibz) 

fl-13 = <t>(Y, X, aihs) 

Restricting the parameters to a, h, and holding a constant, we observe that 
the values must vary with the assigned values of the parameter b, 

and take a minimum value when h takes its true or most probable value. As the 
errors in the approximation to b increase positively and negatively without limit, 
the computed values of <r^ will tend toward the infinite. They may, therefore, 
be assumed to lie on the arc of a parabola whose equation is a quadratic function 
of Xttib; hence, we may form the following equations of representation: 

ffi 1 = &11 -p + Wiidi . 

ffl 2 = kl 2 + ^12®! 4" ^^12®! . 

Xi3 = ki3 -p haO'i + niisR, . 

By addition, we have, 

O'!! + ®'l2 + O'ia = ^11 + ^12 + ^13 + (I'll + ^12 + ^is)®! + (Wu + 2Rl2 + . 

®y appropriate variations in subscript, similar equations may be written in 

02 and 03, thus: 

‘^2 1 + <^22 +‘^23 — ^21 "p lC22 -p ^23 .-p (I21 -p i^22 "P l 2 s ) Cl 2 "p (21121 "p 11122 "p 11123) O 2 • 

o'3 1 ®'3 2 d" <13 3 = ^31 4" ^32 + ^33 + {hi “p ha 4" ^33)<i3 “p (nisi -p Vin 4" 11133)03 • 

These three equations are all of the quadratic form, and may be conveniently 
written as follows : 

Ai = JK-x -p JhiOi 4~ .il^iOj . 

A .2 — JCl -p Z/1Q2 ~p Afl02 • 

A3 = Ki -p Litti -p AfiOj . 
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By precisely similar reasoning, the following equations in b may be developed; 

■Bi — K2 •f' LJ)i Mih\ . 

B2 — K2 L^2 + Mibl , 

B3 = K2 -|- Bibs -b Mjbs ) 

where 

5l = Vll + Vj! + O-ji ; Sj = 0-12 -|- 0-22 + ; -63 = dia + ff 23 + <133 . 

Since the values of ai , a2 , as and 61 , 62 , ha arc assigned, the two sets of equations 
may each be simultaneously solved to obtain values f or iCi , Li , Mi and Ki , L2 , M2 . 
To obtain the conditions for A = a minimum, B - a minimum, we differentiate 
with respect to a and 6, as follows: 

DaiA) = Li -t- 2 Mia; DsiB) = L2 + 2 MJ). 

Setting these two equations equal to zero and solving, we obtain the adjusted 
values of a and b, thus : 

a “ — Bi ; 2 M'i j fe = — Bs : 2M2 , 

The extension of this method to the case of p parameters is obvious. Assign- 
ing three approximations to each parameter, we hold constant a value of one 
parameter (say ai), we form all possible combinations of subscripts for the 
remaining parameters (616263 with CiC2C3 with etc.). This will yield 3 *’“^ values 
of <r^ each of which is associated with Ci . Repeating this process, we can form 
similar sets of values of <r^ by association with and as . We can then form the 
sums = <r(Yxaibc ■••)•, As = <r{Yxa^c •■•); As = aiYxusbc • ■ ). In all, 
3 X 3 ^“^ or 3 ” distinct determinations of <r^ will be required. In like manner, 
the equations for Ri , R2, Bs and Ct, C2, Cs, etc. are'formed. The solutions for the 
adjusted values a,b,c, • • • follow directly. 

Since the method of solution given in Part I requires the computation of but 
2 ^ values of cr"^, it is evident that the method of this section is the more onerous 
when considering the determination of a single set of adjusted values of param- 
eters, the excess being of the order 3 ’’: 2 ’’ = ( 1 . 5 )p. However, being more pre- 
cise, the present method will require fewer approximations to arrive at satisfac- 
tory values of the parameters sought. In other words, the mathematical 
advantage of economy lies with the theta technique; while the advantage of 
precision lies with the quadratic technique, 
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ON A METHOD FOR EVALUATING THE MOMENTS OF A 
BERNOULLI DISTRIBUTION^ 

By Everett H Largxjier, S.J. 


1. The moments (per unit frequency) of a frequency distribution have long 
been regarded as useful characteristics of the distribution. If we denote the 
moment about the arithmetic mean by n, we have for the Bernoulli distribution 

Tj 

x=‘0 

pxqri-x 


where x = x — np and f(x) - 


To evaluate the s-th moment about the arithmetic mean has always been a 
laborious task. Karl Pearson^ gave the s-th moment about the arithmetic 
mean as, 


( 1 ) 


IXi = 


_ [geP® -f- 


Xi=0 


which he said at that time was perhaps the easiest expression for obtaining these 
moment coefficients by successive differentiation. Romanovsky,® however, 
was able to develop the recursion formula, 


( 2 ) 


/^s+i = Pi 


-f 


diii 


for the moments about the mean. Another relation for these moments is 


( 3 ) 


B—l 

j = 0 



[npqp, - 


Recently Kirkham^ gave the expressions for the first eight moments which, 
however, are not in a form well adapted for numerical calculation on a machine. 


I Presented to the American Mathematical Society, January 2, 1936. 

= Karl Pearson, Biomeirika, vol. 12 (1918-1919), footnote, p. 270, This expression is 
obtained from the moment-generating function.' > Obviously this method is exceedingly 
impractical for numerical calculations. 

® V. Romanovsky, "Note on the moments of the binomial (p ■+• (?)” about its mean," 
Biometriha, vol 15 (1923). Recently this expression was given a simple proof by A, T, 
Craig {Bull. Amer Math Soc., vol. 40, pp. 262-264) and extended to the Poisson case 

‘ 'W. J Kirkham, "Moments about the arithmetic mean of a binomial frequency distri- 
bution," Annals of Mathematical Statistics, vol. VI, pp 96-101. 
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2. It is the purpose of this paper to express the s-th moment about the 
arithmetic mean in the form 

(4) S F,An)p‘, 

t=*l 

where F.,«(n) are determinable functions of n dependent on s and t. We note 
here that p and q are the probabilities of the success and failure of an event in 
a single trial 

Since we know that m = npq and jlu = 0, it is evident that the part of (2) 
enclosed in [ ] will be of degree 2 less than s + 1 in p and hence (4) will satisfy 
as a representation of the moment. 

3. To obtain a recursion formula for the functions F,.t{n) we differentiate (4) 
with respect to p. This gives 


dp 


22 tF,,t{n)p' 


t-1 




By (2) we may then write 

Z F,+u(n)p‘ = p(l - p)ns F,.iAn)p‘ + p(l - p) 'll tF,,t(n)p‘-^ 

/"I 

8 

= ns 22 F,-ut-i(n)p* — ns 22 F ,^i,t-i{n)p* 

+ 52 iF,An)p^ - £ (< ~ l)^’..j_i(n)p‘. 
{-1 (-8 

Since this is an identity in p, we have immediately the following recursion 
formula for determining F,,t(n): 

Fs,t(.n) = n(s - l)F 8 _ 2 ,(_i(n) - n(s - l)Fs_ 2 .t- 2 (n) + «F._i.((n) 

(5) 

— {t ~ i)F a-i,t~iin) 

in which 

it > s; 

(6) Fofiin) ~ 1; and F^An) = 0 for < 1, s > 0; 

[t = l,s = l. 

These definitions arise from the known values of the moments and the condi- 
tions imposed by the identity in p. 

By means of (5) and (6) we are able to obtain very readily the values for 
F..((n) which are given in Table 1. 
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TABLE I 
Values of F,.t(n) 


8 



PiM 


1 

0 

0 

0 

0 

2 

n 


0 

0 

3 

n 

—Zn 

In 

0 

4 

n 

~7n + 3n® 

12n -- 6n^ 

-6n + 3w* 

5 

n 

-15n + lOn^ 

50n — 40ft* 

-60ft + 50ft* 

6 

n 

-31n + 26n2 

180n - 180ft* + 15ft* 

-390n + 415ft* - 45ft* 


n 

— 63n + 

602ft - 686ft* + 105ft* 

-2100ft + 2590ft*-525ft* 


n 

“127n + 119n2 

1932ft - 2394 m* + 490ft’ 

-10206ft + 13895ft* 

-3850ft* + 105n‘ 


■ 


F^.aCn) 

1 

0 

0 

2 

0 

0 

3 

0 

0 

4 

0 

0 

5 

24« - 20ft* 

0 

6 

360ft - 390ft* + 45ft* 

-120ft + 130ft* - 15ft* 

7 

3360ft - 4270n* + 945ft* 

-2520ft + 3234ft* - 735ft* 

8 

25200n - 35700tt* + 10990n* - 420n* 

-31920ft + 46004n* - 14770ft* + 630n* 


s 

E.,7(n) 


1 

0 

0 

2 

0 

0 

3 

0 

0 

4 

0 

0 

5 

0 

0 

6 

0 

0 

7 

720ft - 924ft* + 210ft’ 

0 

8 

20160ft - 29232n* + 9520ft* - 420ft* 

-5040ft + 7308n* - 2380ft* + 105n* 


With this table it is a relatively easy task to evaluate the first eight moments 
with the aid of a calculating machine. 

4. As an illustration of the preceding we propose to evaluate tKe first eight 
moments about the arithmetic mean for the binomial, (.06785 -f .93215)^’* 
We first evaluate the coefficients Fs,tin). 
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TABLE IP 
Values of iPs,i(37S) 
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TABLE III 
Values of p‘ F,,t{378) 


8 

2 

3 

4 

6 

pF..i(378) 

25 6473 

25 6473 

25 6473 

25 6473 

p^F..i(,37S) 

-1 7401693 

-5.2205079 

1961 17087 

6651.73743 

3(378) 

0. 

.2361410 

-266 36702 

-1779 32225 

p-'E,.i(378) 

0. 

0 

9 03650 

150 92880 

p6E„B(378) 

0 

0. 

0 

-4.09621 

p'>^3,«(378) 

0 

0. 

0. 

0 

p^F., 1(378) 

0 

0 

0 

0. 

p8E,.B(378) 

0 

0. 

0 

0. 

Ms 

23 9071307 

20 6629331 

1729 48765 

4944 89507 

Q 

Q 

7 

8 


pF, i(378) 

25.647 

25 65 

25,. 6 


p>F..,(378) 

16390.655 

36726 27 

78055.3 


p^F,.,(378) 

245043.490 

1740844.73 

8159870 3 


p*E,.d37S) 

-50255 924 

-593117 96 

41066448.9 


p»E„s(378) 

3414 985 

72517 81 

-11483860 3 


p«E.,(,(378) 

-77 236 

-3828 14 

1177702 2 


piE.,r(378) 

0 

74 21 

-53386 8 


p8E,.s(378) 


0. 

906.6 


Ml 


1253242 57 

38945760 8 



This gives us the desired moments about the arithmetic mean of the binomial 
(.06785 + .93215)*’*. These values may be rapidly checked by applying (3) 
to PS' 


Saint Louis University, 
Saint Louis, Missouri. 





A METHOD OF DETERMINING THE REGRESSION CURVE WHEN 
THE MARGINAL DISTRIBUTION IS OF THE NORMAL 
LOGARITHMIC TYPE 


By Carl-Erik Quensel 

Assistant at the Statistical Institute of the University of Lund, Sweden 


In a paper' in this Journal Professor S. D. Wieksell gave the general ohtlines 
of a new method of calculating the regression lines. This problem was later 
on treated in detail by Dr Walter Andersson,^ His method was to develop 
the formulas for the regression lines into a series of orthogonal polynomials 
under the assumption that the marginal distribution of the independent 
variate belonged to certain mathematically defined distributions, and to de- 
termine the constants with the aid of the method of the least squares. 

Among other cases he treated also the case where the marginal distribution 
was of the normal logarithmic type: 


( 1 ) 


F(x) = 


loge 


<ri \/2ir (x - a) 


-in 

e 2 L 


log {x-a)-r 


But as his method is entirely different from the method I shall give here, I 
will not go any further into the method used by Dr. Andersson. 

When the correlation surface F{x, y) of the variates x and y is given and then 
of course also the marginal distribution of *, F{x), it is known that the mean 
yx of the dependent variate y in an infinitely small array with the value of x 
between x and x + dxia given as a function of the independent variate x by 
the following formula (2) 



In this formula the integrals are to be extended over the whole domain of 
the variation of y. 

If now we make any transformation of x by introducing a new variate u, 
related to x by the formula u = ^(a), where we must suppose that u igva one- 
valued function of x and contrary, the distribution f(u, y) of the variate u and 
y is given by the relation 

(3) /(w, y) du dy — F(x, y) dx dy 


^ S. D, Wieksell, Remarks on Regression. Annals of Mathematical Statistics, 1930. 
2 Walter Andersson Researches into the theory of Regression. Meddelande ftkn 
Lunds Astronomiska Observatorium. Ser 11. N:rW 
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Writing the formula (2) in the following form; 

/ yPi.x,y)dxdij 

y^ == ■L- ^ 

/ Fix, y) dx dy 


we see at once that the mean yx can be given as the following function of u: 


(4) 


2/=: = 


j yf{%y)dy 
j f(u, y) dy 


This relation, of course, is self-evident The mean of the dependent variate 
in an array of the, independent variate will be unchanged, when we change the 
variate x for another variate u, related to a; by a one-valued function. 

The problem of finding the regression line of the mean yx can in such a way 
be much simplified, if it is possible to make a favorable transformation of the 
independent variate x. 

As shown by Professor WickselP we may, under certain conditions concerning 
the marginal distribution fiu), write the expression of the regression line in 
the following form: 


( 5 ) 


yx 


a? 

- s <-«• 

0 


x„.i . 

n! fiu) ’ 


where the \„.i coefficients are the .seminvariants of the distribution of u and y. 
The conditions which the function /(w) must satisfy are among others that 
the function and all its derivates are continuous in the domain of variation and 
that the function and its derivates disappear in the limits of that domain. 
These conditions are satisfied by the normal curve of error. 

In the case where the distribution of u is normal, the derivates take 
the following form; 

(6) = (-l)".&„(w)/(u); 

where the expressions H„iu) are the well known Hermitian polynomials 
The formula (5) takes the following simple form. 

(7) 

0 

If we can change the given marginal distribution Fix) by a favorable substi- 
tution u = f^{x) into a normal cuiwe, and if, this substitution made, we can 


* S. D. Wicksell. Analytical Theory of Eegression. Meddelande frin Lunds Astrono- 
miska Obaervatonum. Ser II Nt69. 
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calculate the coefficients Xn.i from the moments or other known characteristics 
of the given correlation distribution, F{x, y), it is possible to express the regres- 
sion line as the formula (8) shows : 


( 8 ) 1 /. = 


It must be observed that the polynomials Hnl'Pix)] are orthogonal with regard 
to the distribution Fix^ of the independent variate x We have 

j HMx)] H, [Hx)]Fix) dx = I H,iu) H,{u)f{u) du = 0 iV j 

Not in all cases it will perhaps be possible to calculate the X„,i coefficients, 
when we have transformed the marginal distribution into the normal curve, 
but in one case it is rather simple to calculate these coefficients from the mo- 
ments given. 

The case alluded to is the one, where the variate u is given from x by the 
relation u = log(a: — a), that is that the marginal distribution is of the so 
called normal logarithmic type (1). 

In that case it is possible to calculate the X„,i coefficients from the marginal 
moments y„,o and from the correlation moments of the type Vn,i. 

We suppose that the marginal distribution is of the logarithmic type and 
that from the moments of the x distribution we have determined the three 
constants a, xi and I in the usual manner .■* 

Then we calculate from the given correlation distribution the moments F^,o 
about the point x = a and the correlation moments about the point x = a 
and y = my (the mean value of the y-variate). 

From these moments it is possible to calculate the X„,i coefficients in the 
following way. 

The characteristic function of u and y is given by the following relation: 

(9) Uikh) == e ' * = ■/ / fiu, y) dv, dy 


where the integrals are extended over the whole domain of variation. _ 

If the distribution of u is according to the normal law, we have X^.o = 0 for 
fc 3, but in the calculations here it is not at all necessary to suppose anything 
about these higher seminvanants. On the other side, the correlation distri- 
bution f{u, y) is obtained from the characteristic function by the inversion 
theorem. 


( 10 ) 


/(w, y) 


( 2 ' 




S riri b * “b’ 


* How these are to be determined is shown in Pae- Tsi Yuan. On the logarithmic 
Frequency distribution and the Semi-Loganthmic Correlation Surface. Annals of Mathe- 
' matioal Statistics, 1933. 
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But we can also get the following relation 

(11) I y)du = lj e— 

Of this last expression (11) between the characteristic function and the 
distribution function I will make use in the following. 

The moments V-, of the distribution F(x, y) about the point x ^ a, y = 
are given by the formula 

(12) V'tj = y J ®)’(2/ - y) dx Ay. 

If we write y instead at y — and instead of a: — a we write e^“ (t> = jolog e) 
the expression (12) takes the following form: 


(13) 



e’‘“y»7(M, y) du dy 


For the marginal moments of x about the point x = avre get 


(14) 




(x — a)”F(x) dx 


_ j 


du 


Comparing this formula (14) with the expression for the characteristic function 
of the distribution /(w) 


V ^k,0 . h 


(15) t/(«i) = j e^"‘f{u)du = 

we find the following simple relation 

(16) K.o = e 
For the moments of the type we get 

(17) Vn,i= J J e”’‘'^yf(u,y)dudy~ j ydy j e*^'^f{u,y) du. 

If we compare the last integral in the formula (17) J y)du with the 


formula (11) we see that we can write (17) as follows: 

(18) r;,. = i / »*/ 


■tw,yg hM\ 


From the sum (nbYiivhY we may take out the part (nb)*, 



200 


CARL-EEIK QTJENBEL 


where I is zero and which therefore does not contain any dignity of twj, and 
write the remainder in the following form: 




where we have 


= Xiinb + ^ inhy + ^ {nbY 

I = ^ + If (»w> 


The integral 


If 

2rJ 


. tliluV -2 


Xi 






may be considered as a frequency distribu- 


tion <9 (y) with the semin variants X^. 

The formula (18) will thus be written 

(19) F1,i = J ydy^iy) 

According to (16) we have 




and 'as 


we get 
( 20 ) 
or 


/ 


( 21 ) 


ydyipiy) = x^ = Xnnb + inhY + ^ {nhY ■ ■ . 


K.i = K.o-x; 


l^^ = \,nh+^{nhY+^inby 

^n,0 


We see that in the formulas for V'„^i we have all the seminvariants X„,i in- 
volved. A successive determination of the seminvariants X„,i with the aid 
of the moments of the same and lower degree is therefore not possible. 

However, when we use the formula (8) for the regression, wc must suppose 
that the seminvariants X„,i with growing n converge rather soon towards zero. 

/V' A V\ 

If the successive differences A" I J of the quotients — ^ are calculated, it 

V^n.o/ ^n,0 

may be possible to judge, how far it is possible to go with success. These 
difierences will in most cases diminish rather soon and we shall therefore in 
most cases get a value of n about which we can suppose that the differences of 
higher order than this will all be so small that they can be neglected and as a 
consequence of this fact all higher seminvariants can be neglected too. 
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When this value of n has been determined, the n first seminvariants will 

V' 1 

all be obtained from the n first quotients 

* n,0 

Thus we finally get the regression line as follows; 


— my ^ ^ H.Ilog (a: - a) — I] 


or in standardized units; 


n i-. 

X. 1 „ . 


log {x — a) — I 



THE STANDARD ERROR OF A “SOCIAL FORCE” 
By Stuart C. Dodd 
I. Definitions 


In the theory of measurement of social forces certain special cases of frequent 
occurrence where the population shifts from one date of measurement to the 
next require the derivation of appropriate standard error formulae. 

The theory may be briefly restated^ in equations as follows; any measurable 
social change, 0, in a population, P, may be defined as the difference in mean 
scores, 8, from surveys or measurements on the dates denoted by subscripts 

C 2 „i = 82-81^'^ ( 1 ) 


The momentum of a social change may be defined as the product of its time 
rate in years and the population that is being changed 

Ma-i = PV 2-.1 (2) 


= ^ (S, - so (2a) 

1 2-1 I 2-1 

where P’ 2 _i is the period from date I to date 2 and V is the velocity, or speed 

of change, in that period. The acceleration of a social change is definable as 

the rate of change of the velocity of change 


74-3 - V^-t 
.57 ( 4 - 3 - 2 + 1 ) , 


(3) 


where each velocity, being an average for its period, is taken as representing 
the mid-date of that period. 

The resultant social force which produces a measured change is now definable 
as that which accelerates the change in a population. It is measurable as the 
product of the acceleration and the population.^ 


F = AP 




(4) 

- P 1 

f Si 

Si 

Si \ 

(5) 

.57(4-3-2+1) 

^72-1 

72-1 



1 A GonlrolUd Experitnent on Rural Hygiene in Syria, Dodd, S, C., Publications of the 
American University of Beirut, Syria, Social Science Series No 7, 1934, pp. 336. 

Also, A Theory for the ifeaeurement of Some Social Forces, Dodd, S. C , Scientific 
Monthly, Vol. XLIII, No. July 1936, pp 58-62 

’ Force thus defined in terms of its effect is a resultant force, i.e. , the residual force after 
deducting all resisting forces from the total force in the direction of the change observed. 
This formula defines quantitatively and exactly the “net” force not the “grossl’ force 
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II. The Sampling error of one case (momentum) 

The formulae .for the standard errors of sampling for the above concepts, 
social change, velocity, momentum, acceleration and force, (C, F, ilf, A, and F) 
have been published for the case where the population, P, is the same on all 
dates of measurement. But it is not always possible to observe the ideal ex- 
perimental technic of holding the population unchanged in number nor to select 
out individuals common to all the surveys and to neglect the rest. Ordinarily 
there will be different P’s, Pi, P 2 , Pa, and P,, at the different dates. 

To derive the standard errors of (2) and (4) when P shifts, each P is con- 
sidered to be a sub-sample^ of the main sample which is (Pi -|- Pj .j- Pa -j- P 4 ). 
The orthodox view of sampling is taken where the sub-samples may differ in 
size but maintain fixed proportions in each main sample which is drawn from 
the ''parent” population. 

Let primes denote anM, or other function of ( 1 ) to (5), which is an approxi- 
mation due to the shifting of the population and the use of an average P. 

To simplify and generalize the notation, let k denote the constant term com- 
pounded of P’s and T’s which is associated with each S. The first subscript 
of k denotes the function, f, which is any particular one of the left hand members 
of equations (1) to (5) and the second subscript denotes the date of its S. Thus, 
from ( 2 a) 


,, - Pi + P 2 ,, 

■" OV 

Al 2-1 

( 6 ) 

Then (2) may be rewritten: 


ikf2_i = Si kui -j- S 2 km 

(7) 

= E Skm 
, 1 

(7a) 

To derive the standard error of (7) the total differential is: 


dM2-i = kMi d + kM2 d 

( 8 ) 


If Qiji denotes the population common to both dates of measurement so that : 


Pi = Qu -b Qi 

P 2 = Qi 2 d" ^2 


(9) 


producing the change. It thus measures only the obseTvdble part of the total forces in the 
situation. The fundamental problem remains, as always in science, to observe more 
adequately, to devise experimental and atatistical technics for measuring the different 
forces (in isolation and in combinations) which facilitate or resist the measured change. 

» The author is indebted to Mr. S. S. Wilks (Princeton) for this method of deriving these 
standard errors in a fluctuating population. 
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and, since the differential of a sum is the sum of the differentials of the several 
terms, ( 8 ) becomes 


<uf u = ^‘ (S to + 1 *.) + ^ (S <*«• + 1 


dS‘. 


(10) 


Squaring gives 






2 km 

PiPr 


£ dsi 2 “(■ S S *^^2 "I" £ + £ dsi 2 q1s2 

11 11 11 11 


Qi2 

£ 

1 


^ Pi 

Qis Qi 


(T.ds.r + '^i'Eds^)^ 


Q\i 

£ 

1 


Qi 

£ 

1 


Qi 

£ 

1 


(11) 


On summing and dividing by the number of cases to get the expected values, 
the last three terms in the square brackets vanish. Using the relation where, 
in random sampling, the correlation between two variables is the same as the 
correlation between their means 


^12 


^SA 

QlJ O-lOTj 



Qii 


tXl <Ti 


's/ Qn • (3i2 


(12) 


gives 


1 _ <r 1 

— p 



+ 


2 km kui Qi 2 iti (Jt rn 

Pi Pi 


( 13 ) 


Standard error of momentum when the 'population shifts 

The best estimates of ci and crj are the standard deviations of the scores, Si 
and S2 , and the best estimate of nz is, strictly, the covariance of the common cases 
divided by the two sigmas. Unless the selection of Q12 out of Pi and Pi cur- 
tails the range in some way (i.e., Qn is not a random selection), then, except for 
sampling variation, o-i and ai are the same in the Q12 population as ip the Pi and 
Pi populations so that there is only a sampling discrepancy between the ratio 
above and the rn, the observed correlation between the Si and S2 scores in 
the <3 i 2 population. 


III. The generalized standard error 

The above standard error may be readily generalized. Any of the equations 
( 1 ) to ( 5 ) may be expressed as a simple linear sum of the products of a variable, 
S, and its appropriate constant, k. 


f = £" K ife/. 


( 14 ) 
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where / is any one of the concepts S, C, y, 4, it/ ox F defined by (1) to (5) 
and n is the number of surveys, or different S's involved, and i denotes each 
survey in turn from 1 to n. Thus where / means F, (5) becomes- 


Jp F' — kn Si + S 2 + kps S 3 + kpi S 4 

~ £ kpt Si 

t = ] 


(15) 


where 


kpi — 


k = Pi + -Pg + -Ps + P4 

2 y(4_3_2+l) ^(2-1) 


kpi 


Pj ~1~ P2 ~f~ Ps 4 - Pj 

2 3 ^( 4 — 3 — 21 - 1 ) 3 ^( 4 — 3 ) 


(16a) 

(16b) 


In the special ease when a force, F, has been determined from only three 
surveys using two consecutive periods, n = 3 and 


,, _ P1 + P2 + P3 
1.5 y(4_i) (Fj-n 

k = — P^^~P P^) (F(2-n + y(3-2)) 

1.5 y«-i) rc3-2) 7(2-1) 

1 Pi + P 2 + Ps 
1.5 y(3-„ 7(3-2, 


(16c) 

(16d) 

(16e) 


If the difference between two forces (or other functions, /) has been measured 
in either the same or in different populations and the signilcance of the differ- 
ence in terms of its standard error is desired, / of (14) can also denote that 
difference. 

JdF — Fa — Fb', ~ (17) 


It is only necessary to write the difference as a linear sum of products of S 
and k on the model of (2a) or (5) to get the Ic-values for that particular /. 

It IS now possible to write the standard error formula foi- fin a single gen- 
eralized form that covers all the concepts and their differences as defined in 
equations (1) to (5), (14) and (17). Observing that (14) is the general case for n 
surveys of the particular case (7a) where n = 2, it becomes evident, that on 
taking differentials, squaring, summing, and dividing the linear sum of the n 
terms of (14) there results terms of which there arc n that are variances 

(times constants) of the sort and — — are different terms each occurnng 

JT ji 


twice that are covariances (times constants) of the sort 


kkQ(r<rr 


Prom these 
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rough considerations as well as from rigorous derivation, the generalized standard 
error of (14) is found to be: 


~Zj p;f^. > 

The generalized standard error. 


(18) 


Where i and j denote each of the n surveys in turn. There will thus be 
terms to be summed— the number of combinations of i with j including the 
cases where i = j. 

The derivation of (18) as well as its computation from data and its inter- 
pretation in special eases can all be made clearer by arranging the terms in a 
square array as follows : 



i 

1 

2 


n 

3 

Coefficients 

kfiffi 

k/ilTi 


kfn ffn 

i 



Pi 


Pn 

1 

hfi<xi 

Si 

Qiirn 


Qln^n 


Pi 

Bo 

( ) 


( ) 

9 


Qiirn 

m 


Qzn 


ft 

( ) 

DB 


( ) 

i 

• b 

r 

H 



n 


Qln T'ln 

QinVin 


Pn 

Pn 

( ) 

( ) 


( ) 


To get a-f write the computed values of the coefficients -p as captions of rows 

and of columns and write each computed Qr value in its appropriate cell, noting 
that in the main diagonal cells the self-correlations are unities and the popu- 
lation common to both column and row surveys, is the entire population 
of that survey as Qi,- = P, when i = j. Thus Qu = Pi . Next in each cell's 
parenthesis enter the product of three factors, namely: a) the cell Qr term, 
b) the column coefficient, and c) the row coefficient. The sum of these products 
in the parentheses, n® in number, is v® of (18) . 

From the above square array it becomes clear that whenever in (17) the 
difference of two observed forces, or other functions, is derived from different 
populations the Q between these populations is zero so that the entire product 
terms in those cells vanish. Thus in the very simplest and familiar case of 
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comparing two means from different populations, n = 2, Qu = 0, ik = 1, and 
(18) reduces to the usual sum of the two variances of the two means 

2 2 

(T^ difference in means == ^ + ^ (19) 

Pi Ps 


IV. Some special cases 

It'should be observed that the above formulae for the standard errors when 
P shifts all become identical with the simpler formulae previously derived for 
the case of a constant P. In this case, every Qp, = P^, = p^ and in the square 
array (in addition to fc’s which no longer involve an average PX the Q or P of 
the cells and the P's in the row coefficients, may be omitted as they cancel each 
other out. 

Another special but very frequent case is where the social change is not 
given in terms of a difference in means, Si and S} , but in terms of a difference 
in percentages, as when a literacy rate rises from 30% to 40%. A percentage 
can be viewed as a mean of a two-category, aJl-or-none, presen t-or-absent 
variable such as: A, non-A (foreign or native bom, literate or illiterate, etc), 
where A is assigned a value of 1 and non-A a value of 0. Then the sum of the 
values of A, each times its frequency, divided by the population is both a pro- 
portion and a mean. Its standard error in the percentage, p, form of expres- 
sion is then equal to it in the mean form: 


p Vl-OO — p 

ffp = — = <TS 

Vp 


IT. 

Vp 


^where s = 1 or 0 and 



( 20 ) 


60 that where S, in (14) is a percent p(1.00 — p) should be substituted for Vi 
(and ffg) in (18), In this case the appropriate formula to use for getting r<, 
in (18) depends on the nature of the distribution of the variable that is expressed 
in percentage form. If the distribution is normal, tetrachoric r may be ap- 
propriate, while if the <8 in percentage form is from a two point, distribution, 
r from a four fold point surface may be appropriate. 

In all the above cases the usual interpretation of the significance of / in respect 
to sampling errors may be used in entering a normal probability table with a 
given (T/ from (18) and reading the probability of such a/ occurring by chance.* 

For a numerical illustration of this formula (18), consider the case of two 
villages, the statistical significance of whose momentums of a social change are 
to be determined. The data are from a study^ of Syrian villages where an 


* Mr, Wilks comments here that, “there is a more exact and rigorous test for comparing 
the two sets of S’b which enter into a pair of M’s or F’a which involves some recent statisti- 
cal theory but it is doubtful if the extra refinement is worth while at this stage of soci- 
ometrio development.” 
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itinerant Health Clinic in two years changed the average hygienic status of the 
families in each village by amounts of score (on a scale of 1 to 1000 points, 
devised for this study) as indicated in the table below. 


Mean score in 1931 = Si = 

“ “ “ 1933 ='& = 

Population (families) in 1931 = Pi = 

“ “ “ 1933 = P 2 = 

Standard deviation of scores in 1931 = o-j = 

“ “ “ " “ 1933 = 0-2 = 

Families common to both censuses = Q 12 — 
Correlation of scores from the 2 dates = ri 2 = 
fCMl = — (Pi -p F*2)/2 F (2-1) = 

fcji/2 = — k^fl = 
fcAfi<ri/Pi = 

= 

QnTi2 = 

= 

Momentum = M 2^1 
Significance ratio 


Village A 
253 
304 
46 
40 
54 
58 
40 

00 

- 21.6 
21.5 
-25 24 
31.17 
0 

261 
1,097 
' 4.2 


* The calculation of this a by (18) may be illustrated in detail: 


Village B 
321 
528 
46 
32 
39 
70 
32 

.19 
-19.6 
19.5 
-16 53 
42.65 
6.08 
249* 
4,037 
16.2 


Village B 


Coefficients, ^ — > 

1 

2 

-16.53 

42.65 

1 

-16.53 

46 (= Pi) 
(12,571) 

6 08 (= Qr) 
(-4,286) 

2 

42 65 

6.08 (= Qr) 
(-4,286) 

32 (= P2) 
(58,208) 


2( ) = 62,207 

_ 2, 

~ O'jtf(s-l) 

"■"(j-i) = 249 


The momentum of the movement towards' improved hygiene achieved in 
village A is 4.2 times its standard error, while that of village B is 16.2 times 
its standard error. The excess momentum of village A over village B is 


8.1 = 


j times the standard error of their difference in momenta. Since 

all three of these significance ratios are well over 3 the conclusion is that the 
observed momenta and difference of momenta are statistically significant and 
cannot reasonably be due to sampling fluctuations. It may be noted that the 
significance ratios for the amounts of this social change, the difference in mean 
scores, are in close agreement with the above figures, being 4.1 and 15.9 for 
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villages A and B respectively, instead of 4.2 and 16.2 as above These discrep- 
ancies of R .1 and .3 in the statistical significance of these social changes com- 
pared with the corresponding social momenta are accounted for by the fact 
that the shift in the size of the population is allowed for in our formula for 
the case of momenta and is not considered in the usual formula for the case of 
social change. 

A minimum of three measurements of one population is necessary to deter- 
mine a social force. To determine its standard error all the correlations must 
be secured between every pair of measurements, each correlation derived from 
the part of the total population that is common to that pair of measurements. 
Obviously the data as currently reported from surveys and censuses and statisti- 
cal bureaus do not meet these specifications. More rigorous analysis of social 
data and reporting of correlations in it is a prerequisite to the measurement of 
social forces and their significance. 

American University of Beirut, Syria. 



AN APPROXIMATION TO “STUDENT’S” DISTRIBUTION* 


By Walter A. Hendricks 
1. Introduction 

The function commonly known as “Student’s” distribution occupies a promi- 
nent position among the classic contributions to the field of statistics, not only 
for its intrinsic value but also for the stimulus which it gave to statistical re- 
search at the time of its discovery. 

The function, which may be written in the form, 

( 1 ) ■ dF. = 

gives the distribution of the ratio, g, of the estimated arithmetic mean, x, to the 
estimated standard deviation, s, for samples of n observations drawn from the 
normal universe specified by the arithmetic mean, zero, and the , standard 
deviation, o-. This function, together with a table of values ’of its integral was 
given by "Student.”®-’'’ 

In view of the fact that similar distributions were subsequently found by 
Fisher’ to arise in a larger variety of practical problems than was originally 
supposed, a table of values of a new integral was later given by "Student”” 
in which the distribution of a variable, t, defined by the relation, 

(2) t = {n - l)*g, 

rather than the distribution of z itself, was considered. Another table giving 
the distribution of t, in a form intended to be more convenient for use by re- 
search workers wishing to apply statistical methods to experimental data, was 
later given by Fisher,® 

The integration of functions of the type defined by equation (1) involves 
considerable labor, a fact which has been somewhat embarrassing to practical 
statisticians interested in the distributions of z and t for values of n larger than 
those included in the above-mentioned tables. The recent appearance of 
Tables of the Incomplete Beta-Function, prepared under the direction of 
Pearson,’ has considerably alleviated the difficulty, but the requirements of 
certain practical problems arc not easily satisfied even with the aid of these 
tables. Consequently, simple approximations to the distributions of z and t, 


*A thesis submitted to the Faculty of the Columbian College of The George Wash- 
ington University in part satisfaction of the requirements dor the degree of Master of 
Arts. 


210 



AN APPROXIMATION TO “sTUDENt’s” DISTRIBUTION 


211 


which will bo sufficiently accurate for moat practical purposes, should be of 
some interest. 

According to “Student,”** the distribution of z tends to approach a normal 
curve with a standard deviation of (n — 3)~^ for values of n greater than 10, 
However, Doming, and Birge* have recently suggested that the distribution 
tends to approach a normal curve with a standard deviation of (n ~ 1|)“* 

This thesis presents a simple approximation to the distribution of z, which 
can be readily extended to the di.stribution of t and which will give more accu- 
rate results than either of the above approximations. 

11. Approximation to the Distribution of Z 

The approximation presented here is based upon the assumption that, for 
large values of n, the distribution of s tends to approach a normal curve with the 

arithmetic mean, s, and the standard deviation, that is, 


dF, = -T-c as. 


Since the distribution of the estimated arithmetic mean, is known to be 
normal, with the standard deviation, we have for the joint distribution of 


and X'. 




i may be expressed in terms of n and c by the well-known relation, 


(5) ® 

in which the factor, c„ , is defined by the formula, 

2» r(4n) 

(6) " ni r[Kn - !)]■ 


If we write, in place of §, in equation (4) and make the transformation, 


X = sz, 


we have for the joint distribution of s and z: 




To find the distribution of z, all that is necessary is to write; 

)) ' dF. s dsj dz. 
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ill which; 


k = 




( 10 ) 


2W2 


a=~U^ + 2V, 

b = (z^ + 2)- 


The integral in brackets in equation (9) can be evaluated without any diffi- 
culty, Wchave; 


( 11 ) 




4'“ > , 

g-(as~6)- S ds = — r • 


Substituting this value in equation (9) and replacing fc, a, and b by the quan- 
tities which they represent, we obtain the following expression for the distribu- 
tion of z. 


( 12 ) 


dF, = ( 2 = -h 2)-3 <Iz. 


If we now' define a new variable, u, by the relation, 


(13) 


= 2nCn- 


•f 2' 


and make the appropriate substitutions in equation (12), we have, for the dis- 
tribution function of tt ; 


(14) 




Equation (14) ik obviously a normal curve with unit standard deviation. We 
have thus deduced the interesting fact that, for values of n sufficiently large so 
that the distribution of s may be represented by a normal curve, the quantity, 


2’‘nkn 


is distributed as a normal deviate with unit standard deviation. 


(2^ -p 2)i’ 

The accuracy of this approximation as compared with that of the approxima- 
tion suggested by “Student”® and that of the more recent approximation sug- 
gested by Demmg and Birge' may now be considered. As previously stated, 
the “Student” approximation is based on the assumption that the quantity, 
(a — 3)^z, is distributed. as a normal deviate with unit standard deviation for 
values of n greater than 10, while that suggested by Deming and Birge is based 
on the assumption that the quantity, (n — If) * 2 , is so distributed. 

Table 1* gives values of the integral, 7*, defined by 


(15) 


Tz ~ / (l -P 

- 1), I] U ^ ^ ^ 


dz, 


* All tables and charts to which reference la made are to be found in the Appendix 
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for the case, n = 10, together with the corresponding approximate values ob- 
tained by making use of the three approximations suggested by “Student,” 
Peming and Birge, and the present author, respectively. The exact values 
and those obtained by the “Student” approximation were derived from values 
calculated by “Student”® and given by Pearson.® All other data in the table 
were calculated by the present author. 

An inspection of Table 1 shows that the values of based on the approxima- 
tion presented in this thesis agree very well with the corresponding exact values. 
The agreement is better than that found in the case of either of the other two 
approximations. The Deming and Birge approximation gives better results 
than the “Student” approximation for values of z in the neighborhood of zero, 
but for other values of z the opposite is true. 


Ill, Approximation to the Distribution of t 

Since tables giving the distribution of the variable, t, have largely superseded 
those giving the distribution of z in practical statistical work, the feasibility of 
applying the above three approximations to the distribution of t is worthy of 
consideration. 

The variable, t, has already been defined in terms of n and z by equation (2). 
If, in equation (12), we make the transformation, 

(16) z = (n - l)H, 
we have, for the distribution function of t: 

(17) dF, = [p + 2(n - I)]"* dt. 

TT^ 


If we now define a variable, v, by the relation. 


(18) 


= 2nc„® 


t® 

t® + 2(n- 1)' 


we have, for the distribution function of v: 

(19) 

Equation (19) shows that, for values of n sufficiently large so that the distribu- 
tion of s may be represented by a normal curve, the quantity, 

is distributed as a normal deviate with unit standard deviation. On the other 
hand, if we assume with “Student” that, for large values of n, the quantity, 
(n — 3)^z, is normally distributed about zero with unit standard deviation, we 

should expect to find that the quantity, is also distributed as a normal 
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deviate with unit standard deviation If the Deming and Birge approximation 
to the distribution of z is assumed to be valid, we should expect to find that the 


i, is distributed as a 'normal deviate with unit standard 


deviation. 

To test the accuracy of each of these three approximations to the distribution 
of t, we may make use of the well-known table of values of t given by Fisher.® 
This table is so constructed that a value of t corresponding to a given number of 
“degrees of freedom” and a . given value of “P” may be read from the table, 
where P is defined by the relation. 


( 20 ) 


P = 1 - 


(7i-l)iB[Ka-l) 




The entries in the last line of the table, corresponding to an infinite number of 
“degrees of freedom,” are the deviates of a normal curve with unit standard 
deviation. 

To test the accuracy of the “Student” approximation, we may calculate the 
entries for a line of this table, corresponding to n — 1 “degrees of freedom,” by 

(n - 1)* 


multiplying the entries in the last line of the table by 


These approxi- 


(n - 3)1 ■ 

mate values of i may then be compared with the exact values given in the table. 
The accuracy of the Deming and Birge approximation may be tested in the same 
manner, except that in this case the entries in the last line of the table should be 

(ft - 1)1 


multiplied by 


To test the accuracy of the approximation given by 


(ft — 1^)1 

equation (19), we may calculate the values of t corresponding to n — 1 “degrees 
of freedom” by means of the relation. 


( 21 ) 


2(n — l)y® 

2ftC„® — D* ’ 


in which the entries in the last line of the table are to be taken as the values of v. 

Table 2 gives the exact values of t corresponding to the values of P given in 
Fisher’s table for n = 10, together with the approximate values calculated by 
means of each of the above three approximations. This comparison of the 
accuracies of the three approximations is equivalent to the comparisons pre- 
sented in Table 1 , The conclusions which may be drawn are in agreement with 
those which have already been drawn from that table. 

In order to test the behavior of each of the approximations for a larger value 
of ft, values of t corresponding to the different values of P were calculated for 
ft = 30. The results are presented in Table 3. The rank of each of the three 
approximations, with regard to accuracy, for ft = 30 is the same as for w = 10. 
Although all three give more accurate results for the larger value of ft, the 
superiority of the approximation presented in this thesis is quite apparent. 
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For extremely large values of r, all three approximations evidently tend to 
become one-hundred percent accurate, for the distribution of t tends to become 
normal as n is increased indefinitely. In the case of the "Student" and Deming 

and Birge approximations, the ratios, and obviously ap- 

proach unity, respectively, as n becomes very large. The approximate value of 
t given by equation (21) also tends to approach the normal deviate, «, as n is 
increased for we have: 


( 22 ) 


lira = 

n-»«» 


2(n - W 

St 255?^ 


n~*ao [_2?lCn^ — 


2v^ 


2ncJ — 


= lim 


2y* 


/* * _ 
t'n 


2n 


IV. Discussion 

The greater accuracy of the approximation to the distribution of z presented 
in this thesis apparently can not be explained by the hypothesis that the distribu- 
tion of s becomes normal more rapidly than the distribution of 2 as n is increased. 
Table 4 presents values of the ordinates of the normal curve with unit standard 
deviation, together with the corresponding ordinates of the exact distributions 

of the quantities, (s — §), (n — 3)h, (n — 1|)*2, and 2hk„ * 




(z* + 2)‘’ 


for 


n — 10. Although the distribution of (s — s) seems to follow the normal 

IT 

curve more closely than does the distribution of (n — 3)% the opposite seems 
to be true in the case of the distribution of (n — H )^2 The distribution of 
2 


2tnlc„ 


, , however, follows the normal curve quite closely. 


(2® -|- 2)* 

The behavior of these distributions for n = 10 can be observed more easily 

in Figures 1, 2, and 3 in which the frequency curves of (s — s), (n — 3)^2, 

c 

and (n — are respectively plotted together with the normal curve with 

unit standard deviation. The frequency curve of 2Hhn 2 )! 

plotted because of the fact that this curve follows the normal curve so closely 
that the two curves could not be distinguished when plotted on the scale u?ed in 
the other three charts. ' 

The most reasonable conclusion which can be drawn from Table 4 and Figures 
1, 2, and 3 is that the departure of the exact distribution of s from the normal 
curve has very little effect in destroying the normality of the distribution 
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V. Values of the Factor, c„ 


For the practical application of the approximations to the distributions of 
z and t presented in this thesis, a table of values of the factor, c„ , is required. 
Values of this factor, for values of n as high as 100, have been tabulated by 
Pearson^’ ® and by Shewhart.® For values of n greater than 100, Cn may be 
calculated accurately to at least five significant figures by the following relation, 
given by Pearson* and by Deming and Birge*; 


(23) 


Cn — 1 


^ ‘7__ 

4n 32n“‘ 


Table 5 presents values of Cn for some large values of n, calculated by the 
present author. For values of n not included in this table, c„ may be calculated 
by means of equation (23) just as rapidly as by interpolation in the table. 


VI. Summary and Conclusions 

For values of n sufficiently large so that the distribution of s may be repre- 
sented by a normal curve, the quantities. 


2^iCn 


(z^ + 2 )» 


and 2*n*c. 


[i2+2(n -!)]»’ 


are distributed as normal deviates with unit standard deviation. The results 
obtained by assuming a normal distribution of s are more accurate than those 
obtained by assuming that either (n — 3)^2 or (n — li)h is distributed as a 
normal deviate with unit standard deviation. For extremely large values of n, 
the distribution of each of the above quantities tends to approach a normal 
curve with a mean of zero and unit standard deviation. 
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VIII. Appendix 
TABLE 1 


Exact values of h and approximate, values, derived from tables of the normal 
prohahlity integral, for n = 10 


2 

I. 

Exact value 

“Student” 

approximation 

Deming & Birge 
approximation 

Hendricks 

approximation 

-2.0 

.0001 

0000 

0000 

.0004 

-1 8 

.0002 

0000 

.0000 

.0006 

-1.6 

.0005 

.0000 

0000 

0010 

-1.4 

.0011 

.0001 

,0000 

.0018 

-1.2 

.0029 

.0007 

0002 

0038 

-1 0 

.0075 

0041 

.0018 

.0086 

™ 8 

.0199 

0171 

0098 

.0211 

- ,6 

.0527 

.0562 

.0401 

0535 

- .4 

.1304 

.1448 

" 1218 

.1307 

- .2 

.2816 

2984 

2799 

2817 

0 

.5000 

5000 

5000 

.5000 

+ .2 

7184 

.7016 

7201 

.7183 

*}“ 4 

8696 

8552 . 

.8782 

8693 

■ 6 

.9473 

.9438 

.9599 

9465 

+ .8 

.9801 

9829 

,9902 

9789 

•fl.O 

.9925 

.9959 

9982 

.9914 

+1.2 

.9971 

.9993 

.9998 

.9962 

+1.4 

.9989 

.9999 

1.0000 

.9982 

' +1 . 6 

.9995 

1 0000 

1 0000 

9990 

+1 . 8 

.9998. 

1 0000 

1 0000 

.9994 

+2.0 

.9999 

1.0000 

1.0000 

.9996 
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TABLE 2 


Exact values of t corresponding to different values of P and approximate values, 
derived from normal deviates, for n = 10 


p 


t 



Exact value 

“Student” 

approximation 

Deraing & Birge 
approximation 

Hendricks 

approximation 

.90 

.129 

.142 

.129 

129 

.80 

261 

.287 

.261 


.70 

.398 

.437 

,396 

.398 

.60 

.543 

.696 

.540 

544 

.50 

.703 

.765 

.694 


.40 

,883 

.954 

.866 

.884 

.30 

1.100 

1.175 

1.066 


.20 

1.383 

1.453 

1 319 

1.386 

.10 

1.833 

1.865 

1.693 

1.844 

,05 

2 262 

2.222 

2 017 

2 290 

.02 

2.821 

2 638 

2 394 

2.896 

.01 

3.250 

2.921 


3.389 


TABLE 3 


Exact values of t corresponding to different values of P and approximate values, 
derived from normal deviates, for n = 30 


P 

i 

Exact value 

“Student” 

approximation 

Deming & Birge 
approximation 

Hendricks 

approximation 

.90 

.127 

.130 

.127 

.127 

.80 

.256 

.263 

.256 

.256 

.70 

.389 , 

.399 

.389 

.389 

.60 

530 

.543 

.529 

.530 

.50 

.683 

699 

680 

.683 

.40 

854 

.872 

849 

.854 

.30 

1 055 

1 074 

1.045 

1,055 

.20 

1.311 

1,328 

1.293 

1,312 

.10 

1.699 

1.705 

1,659 

1.700 ' 

.05 

2.045 

2.031 

1.977 

2.047 

.02 

2.462 

2 411 

2,347 

2.466 

,01 

2.756 

2.670 

2.598 

2.764 
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TABLE 4 

Ordinates of the normal curve with unit standard deviation and ordinates of the 

2in^ 

exact distrihution functions of (s — s), {n ~ 3)^2, (re — 15 )^ 2 , and 


2in^Cn 


<r 

Z 


(22 4- 2)i 


for n = 10 


Deviation from 
mean 

Ordinates of distribution function 

Normal 

deviate 

j-v 

(n - 3)12 

in - li)i 2 

>>ui„ * 


" ^'‘(*’+3)1 

- 3.0 


.0006 

.0071 

0108 

0034 

- 2.5 


.0085 

.0181 

.0254 

.0156 

- 2.0 


.0454 

.0459 

.0581 

.0544 

-1 5 

.1295 

.1356 

.1092 

.1234 

.1306 

-1.0 


.2663 

2256 

.2290 

.2426 

- 5 


3751 

.3692 

.3454 

.3522 

.0 

.3989 

3999 

.4400 

.3991 

.3990 

-j" . 5 


3343 

.3692 

3454 

.3522 

0 


2245 

.2256 

.2290 

2426 

+ 1.5 

1295 

.1233 

.1092 

.1234 

1306 

+2 0 



0459 

.0581 

,0544 

+ 2.5 

mBM 


.0181 

.0254 

.0156 

+ 3.0 

■B 


0071 

.0108 

, 

.0034 


n 


100 

150 

200 

250 

300 

350 

400 

450 

600 

600 

700 

800 


TABLE 5 


Values of c„ for large values of n 


Cn 


99248 

,99499 

.99624 

.99700 

.99750 

99786 

.99812 

.99833 

.99850 

.99875 

,99893 

99906 


n 


900 

.99917 

1000 

.99925 

2000 

,99962 

3000 

99975 

4000 

,99981 

5000 

.99985 

10000 

99992 

20000 

.99996 

30000 

.99997 

40000 

.99998 

50000 

.99998 

100000 

.99999 
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